REMARKS 

The present invention relates to novel nucleic acids encoding a eukaryotic 
Survival of Motor Neuron-Interacting Protein 1 (SIP-1), and proteins encoded thereby. 

The instant application, as being filed herewith, is a divisional application of U.S. 
Patent Application No. 09/399,081, filed September 17, 1999 (the "parent application"), now 
allowed, which claims priority pursuant to 35 U.S.C § 1 19(e) to U.S. Provisional AppUcation 
No. 60/100,866 ("provisional application *866"), filed on September 17, 1998. 

This application is being filed to prosecute an invention that was not elected in 
the parent application. Further, the amendments made herein relate in essence to formal matters 
addressed previously in the parent application as more fully set forth below. 

Claims 

By restriction requirement mailed on October 24, 2000 (Paper No. 8) in the parent 
application, original claims 1-101 were restricted into thirty-two (32) groups. By response filed 
November 22, 2000, Applicants elected the claims of Group I for prosecution in the parent 
application. 

The parent Application having been allowed. Applicants now elect the claims of 
Group III as restricted in the parent application, for prosecution in the present divisional 
application. 

Further, Applicants have canceled, without prejudice, original claims 1-7 and 14- 
101, which are drawn either to non-elected inventions or are now allowed. Applicants reserve 
the right to pursue the subject matter of the canceled claims in any later continuation and/or 
divisional application(s). Applicants have also added new claims 102-107. Accordingly, upon 
entry of the present Amendment, claims 8-13 and 102-107 are pending in the instant appHcation. 

Original claims 8-13 have now been amended such that they recite that the 
nucleic acid comprises a nucleic acid encoding a tag polypeptide. That is, claim 8 has been re- 
written to be in independent form since it depended fi-om claim 1 which is now allowed. Further, 
claims 10-12 have been amended herein to depend from claim 8. Since claims 10-12 recite the 
subject matter of original claims 10-12 which now depend from claim 8, and since claim 8 has 
merely been amended to be in independent form, the amendments to claims 8 and 10-12 do not 



1865607_5.DOC 



5 



add any new matter. Claim 13 has also been amended to recite a recombinant pre-B lymphoid 
DT40 cell comprising the nucleic acid. Support for claim 13 is found in the specification as filed 
commencing on line 24 of page 48. Further, claims 8-13 are supported by the specification as 
filed commencing at page 5, and no new matter has been added by way of these amendments. 

Additionally, new claims 102-107, as added herein, are supported by the 
specification as filed and no new matter has been added by the addition of these claims. More 
specifically, these new claims recite essentially the same subject matter as claims 14-17 as 
originally filed. That is, these claims recite a nucleic acid encoding a tag polypeptide covalently 
linked thereto and cells and vectors comprising the said nucleic acid. Support for new claims 
102-106, reciting a vector and a cell, is found in the specification as filed beginning on line 20 of 
page 45. Support for claim 107 is found in the specification as filed beginning on line 1 1 of page 
35. Thus, newly added claims 1 02-1 07 do not add any new matter. 

Substitute specification 

A substitute specification is being filed herewith which identical substitute 
specification was filed in the parent application. Briefly, during prosecution of the parent 
application, the Examiner requested that Applicants amend the disclosure to include material 
previously incorporated by reference to the provisional application (i.e., Appln. No. 60/100,866). 
Accordingly, in a good faith effort to expedite prosecution of the parent application. Applicants 
filed a substitute specification accompanied by a declaration stating that the material in the 
substitute specification was originally set forth in the *866 provisional application, and that no 
new matter had been added to the parent application. The amendment submitting the substitute 
specification was duly entered into the parent application and the claims are now allowed. 

Applicants, in an effort to expedite prosecution of this divisional application, 
submit herewith a copy of the substitute specification (excluding claims) and the declaration 
which were submitted in the parent application. Thus, the substitute specification submitted 
herewith is identical to the substitute specification filed in the parent application. That is, the 
substitute specification consists of the original specification as filed in the parent application and 
sets forth an additional example designated Example 9 (which was Example 3 in the '866 
provisional application), beginning at page 164, line 10, of the substitute specification, and 
concluding at page 172, line 27. The substitute specification further comprises the Brief 



1865607J.DOC 



6 



Description of the Drawings describing additional Figures 39 through 52, beginning at page 35, 
line 25 and concluding at page 39, line 8 (which were previously numbered Figure lOA through 
Figure 18 in the '866 provisional application) and Figures 39 through 52 themselves (previously 
numbered Figure lOA through Figure 18 in the '866 provisional application). Thus, the Figures 
39-52 and their descriptions are merely the drawings and descriptions thereof as originally set 
forth in the *866 application, now being explicitly incorporated into the present application. 

The present amendments to the specification do not include any new matter as 
they merely insert material previously set forth in the '866 provisional application, fi-om which 
the present application claims priority and which had been previously recited as incorporated by 
reference in the parent application. The present amendments only change the numbering of 
Example 3 (now Example 9) and Figures lOA through 18 (now Figures 29-52) of the '866 
provisional application to preserve the continuity of the present application into which they have 
been explicitly inserted. Therefore, these amendments to the specification and drawings include 
no new matter since all of this material was set forth in the *866 provisional application and was 
duly incorporated by reference in the parent application claiming priority thereto. 

Applicants also submit herewith a copy of a declaration, pursuant to 37 C.F.R. 
1 .125, stating that the substitute specification adds no new matter, which declaration was filed in 
the parent application. 

Drawings 

In the prosecution of the parent application, U.S. Application No. 09/399,081, the 
Examiner objected to the drawings. Applicants, in a good faith effort to expedite the prosecution 
of the present application hereby submit revised formal drawings. These drawings were also 
submitted in the parent application, which is now allowed, and Applicants respectfiiUy assert that 
the fifty-six (56) sheets of drawings, Figures 1 through 52, add no new matter since they merely 
correct informalities in the drawings as originally filed in the parent application and also 
incorporate the drawings set forth in U.S. Provisional Application No. 60/100,866, as was done 
in the parent application. Applicants respectfiiUy submit that the formal drawings being filed 
herewith correct any informalities in the original drawings filed in the parent application and 
comply fiilly with the provisions of 37 CFR 1.84. 



1865607J.DOC 



7 



Summary 

Applicants respectfully submit that claims 8-13 and 102-107, are in condition for 
allowance. Consideration and allowance of each of these claims are respectfully requested at the 
earliest possible date. 

Respectfully submitted, 
GIDEON DREYFUSS et al. 

(IfMuu^ "ZXOO ^ By: Oi^--^ ^ QJx/Zs^J^ 

Date RAQUEL M. ALVAREZ, Ph.D., J,D. 

Registration No. 45,807 

MORGAN, LEWIS & BOCKIUS, L.L.P. 

1701 Market Street 

Philadelphia, PA 19103-2921 

Telephone: (215) 963-5000 

Direct Dial: (215) 963-5403 

Facsimile: (215)963-5001 

E-Mail: ralvarez@morganlewis.com 

Attorney for Applicants 



RMA/QDN 

Enclosures: Transmittal of Formal Drawings 

Revised fifty-six (56) sheets of formal drawings. Figures 1-52 
Copy of the specification and original drawings as filed in Ser. No. 09/399,081 
Clean and Marked-up copies of the substitute specification (minus claims) 
Copy of Declaration pursuant to 37 C.F.R. 1.125 



1865607_5.DOC 



8 



209596.0292/9596-98U1 
(L-1906) 

COMPOSITIONS, METHODS, AND KITS USEFUL FOR THE 
5 DIAGNOSIS AND TREATMENT OF SPINAL MUSCULAR ATROPHY 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application claims priority pursuant to 35 U.S.C. § 1 19(e) to U.S. 
Provisional Application No. 60/100,866, filed on September 17, 1998. 

10 

STATEMENT OF FEDERALLY SPONSORED RESEARCH AND 

DEVELOPMENT 
This invention was supported in part by funds fi-om the U.S. 
Government (National Institutes of Health Grant No. R01-GM37125) and the U.S. 
1 5 Government may therefore have certain rights in the invention. 

BACKGROUND OF THE INVENTION 
Spinal muscular atrophy (SMA) is characterized by degeneration of the 
anterior horn cells of the spinal cord, leading to progressive symmetrical limb and 

20 trunk paralysis and muscular atrophy. SMA is the second most common fatal 

autosomal recessive disorder, second only to cystic fibrosis, and the most coimnon 
genetic cause of childhood mortality affecting 1 in 6,000 newborns (Roberts et al., 
1970, Arch. Dis. Child. 45:33-38; Peam, 1973, J. Med. Genet. 10:260.-265; Peam, 
1978, J. Med. Genet. 15:409-413; Czeizel and Hamular, 1989, J. Med. Genet. 21:761- 

25 763). Childhood spinal muscular atrophies are divided into severe (type I, Werdnig- 
Hoffinan disease) and mild forms (type II and III) according to the age of onset and the 
severity of the disease (Munsat, 1991, Neuromusc. Disord. 1:81; Crawford and Pardo, 
1996, Neurobiol. Dis. 3:97-1 10). The Survival of Motor Neurons (SMN) gene 
(Lefebvre et al., 1995, Cell 89:155-165) has been shown to be the SMA disease gene, 

30 and it is deleted or mutated in over 98% of SMA patients (Bussaglia et al., 1995, Nat. 

-1- 

10873 V.2 



Genet. 11:335-337; Chang et al, 1995, Am. J. Hum. Genet. 57:1503-1505; Cobben et 
al., 1995, Am. J, Hum. Genet. 57:805-808; Hahnen et al, 1995, Hum. Mol. Genet. 
4:1927-1933; Hahnen et al, 1996, Am. J. Hum. Genet. 59:1057-1065; Lefebvre et al, 
1995, Cell 89:155-165; Rodrigues et al., 1995, Hum. Mol. Genet. 4:631-634; Velasco 
5 et al, 1996, Hum. Mol. Genet. 5:257-263; Lefebvre et al., 1997, Nat. Genet. 16:265- 
269). 

Two inverted gene copies of the SAfNgcnc are located in a 500 kb 
inverted repeat at chromosome 5ql3. In over 98% of all SMA patients, the telomeric 
copy of SMN (SMN^) is deleted or mutated while the centromeric copy of the gene 
10 (SMN^) is unaffected (Lefebvre et al., 1995, Cell 89:155-165). 

The SMNgCTiQ encodes a protein of about 296 amino acids having a molecular mass of 
approximately 40 kDa. The sequence of the protein does not exhibit any significant 
homology to any other protein of known function in the ciirrently available protein 
databases. 

15 Recently, in the course of studies of the functions of heterogeneous 

nuclear ribonucleoproteins (hnRNPs) (Dreyfuss et al., 1993, Ann. Rev. Biochem. 
62:289-321), it was found that the SMN protein interacts with fibrillarin, an RNA- 
binding protein involved in rRNA processing, and with several other RNA-binding 
proteins (Liu and Dreyfuss, 1996, EMBO J. 15:3555-3565). Monoclonal antibodies to 

20 SMN localized the protein to a unique cellular location. SMN exhibits a general 
localization in the cytoplasm and is particularly concentrated in several prominent 
nuclear bodies called gems (for gemini of coiled bodies). Gems are novel nuclear 
structures which are related in number and size to coiled bodies and are usually found 
in close proximity to them (Liu and Dreyfuss, 1996, EMBO J. 15:3555-3565). Coiled 

25 bodies, which were first described by Ramon y Cajal (1903, Trab. Lab. Invest. Biol 
2:129-221), are prominent nuclear bodies found in widely divergent organisms, 
includmg plant and animal cells (Bohmann et al, 1995, J. Cell Sci. 19:107-1 13; Gall et 
al, 1995, Dev. Genet, 16:25-35). Coiled bodies contain the spliceosomal Ul, U2, 
U4/U6, and U5 snRNPs, U3 snoRNAs, and several proteins, including the specific 

30 marker p80-coilin, fibrillarin, and NOP140 (Bohmann et al., 1995, J. Cell Sci. 19:107- 

-2- 

10873 V.2 



1 13, and references therein; Gall et al., 1995, Dev. Genet. 16:25-35). Expression of 
p80-coilin mutants and microscopic observations suggests a close association between 
coiled bodies and the nucleolus (Raska et al, 1990, J. Struct. Biol. 104:120-127; 
Andrade et al, 1991, J. Exp. Med. 173:1407-1419; Bohmann et al, 1995, J. Cell Biol. 
5 131:81 7-83 1 ). However, the specific functions of coiled bodies are not clear. Current 
ideas propose that coiled bodies may be involved in processing, sorting, and assembly 
of snRNAs and snoRNAs in the nucleus. The close association of gems and coiled 
bodies raises the possibility that the SMN protein and gems are also involved in the 
processing and metabolism of small nuclear RNAs (Liu and Dreyfuss, 1996, EMBO J. 

10 15:3555-3565). 

The Sm class of small nuclear ribonucleoproteins (snRNPs) Ul, U2, 
U4/6, and U5 are major constituents of the spliceosome, the catalytic center of the pre- 
mRNA splicing reaction (Moore et al., 1993, In: The RNA World, pp. 303-358, 
Gesteland and Atkins, eds., Cold Spring Harbor Laboratory Press, Plainview, NY; 

15 Madhani and Guthrie, 1994, Annu. Rev. Genet. 28:1-26). Each spliceosomal snRNP 
consists of one (Ul, U2, and U5) or two (U4/6) snRNAs, a common set of at least eight 
Sm proteins, termed B, B*, Dl, D2, D3, E, F, and G, and specific polypeptides that are 
associated with only one individual U snRNP (reviewed by Liihrmann et al, 1990, 
Biochim. Biophys. Acta Gene Struct. Express. 1087:265-292). With the exception of 

20 U6, all spliceosomal snRNAs share two structural features: the 5'-terminal 

trimethylguanosine (m^G) cap and a short, single-stranded, eight-to-ten nucleotide 
uridine-rich sequence flanked by two hairpin loops, referred to as the Sm site (Branlant 
et al, 1982, EMBO J. 1:1259-1265; Reddy and Busch, 1988, In: Structure and 
Function of Major and Minor Small Nuclear Ribonucleoprotein Particles, pp. 1-37, 

25 Bimstiel, ed.. Springer- Verlag, Berlin). The Sm site is the primary binding site for the 
Sm proteins. The remaining snRNA domains provide binding sites for the snRNA- 
specific snRNP proteins and for RNA-RNA interactions (Liihrmaim et al, 1990, 
Biochim. Biophys. Acta Gene Struct. Express. 1087:265-292). U6 differs from the 
other spliceosomal U snRNAs in that it contains a y-monomethyl cap instead of the 

30 (maG) cap and does not bind directly to Sm proteins due to its lack of an Sm site 

-3- 

10873 V.2 



(Reddy and Busch, 1988, supra\ Singh and Reddy, 1989, Proc. Natl. Acad. Sci. USA 
86:8280-8283). The snRNP-specific proteins have snRNP-specific functions in the 
splicing reaction. In contrast, the only known function for the Sm proteins is in the 
biogenesis of U snRNPs. 
5 The biogenesis of snRNPs, which is illustrated in Figure 26 herein, is a 

complex, multistep process (DeRobertis, 1983, Cell 32:1021-1025; Fisher et al, 1985, 
Cell 42:751-758 ; Mattaj, 1988, In: Structure and Function of Major and Minor Small 
Nuclear Ribonucleoprotein Particles, pp. 100-1 14, Bimstiel, ed., Springer- Verlag, 
Berlin; Feeney et al., 1989, J. Biol. Chem. 264:5776-5783; Neuman de Vegvar and 

10 Dahlberg, 1990, Mol. Cell. Biol. 10:3365-3375; Zieve and Sauterer, 1990, Crit. Rev. 
Biochem. Mol. Biol. 25:1-46). Spliceosomal snRNAs that contain the Sm site are first 
exported to the cytoplasm, where they associate with the Sm proteins (B, B', Dl, D2, 
D3, E, F, and G) (Mattaj and DeRobertis, 1985, Cell 40:1 1 1-118). Next, in a reaction 
that requires the assembled Sm core domain (comprising the Sm proteins bound to the 

15 Sm site), the 7-methylguanosine (m^G) cap of the snRNAs is hypermethylated to yield 
2,2,7-trimethylguanosine (maG) (Mattaj, 1986, Cell 46:905-91 1). In addition, varying 
numbers of nucleotides are trimmed from the 3' end of several of the snRNAs. Proper 
Sm core assembly, cap hypermethylation, and 3'-end processing are important for 
nuclear import of the assembled snRNP particles (Fischer and Luhrmann, 1990, 

20 Science 249:786-790; Hamm et al., 1990, Cell 62:569-577). Finally, just before or 
after the nuclear import, many intemal nucleotides are modified and more than 30 
snRNP-specific proteins associate with the individual snRNP precursors to complete 
their biogenesis (Mattaj, 1988, In: Structure and Function of Major and Minor Small 
Nuclear Ribonucleoprotein Particles, pp. 100-1 14, Bimstiel, ed.. Springer- Verlag, 

25 Berlin; Liihrmann et al, 1990, Biochim. Biophys. Acta Gene Struct. Express. 

1087:265-292; Neuman de Vegvar and Dahlberg, 1990, Mol. Cell. Biol. 10:3365-3375; 
Zieve and Sauterer, 1990, Crit. Rev. Biochem. Mol. Biol. 25:1-46). However, the 
detailed mechanism of how the Sm core proteins and the snRNP-specific proteins form 
functional assembled snRNPs is not clear. 

-4- 

10873 v.2 



There is, to date, no effective treatment for SMA and the mechanism underlying the 
disease process is poorly understood. Thus, there is an acute and long-felt need to 
understand the mechanism of the disease process and, more importantly, for the • 
development of methods of treating this common and usually fatal disease. The 
5 present invention addresses these needs. 

SUMMARY OF THE INVENTION 
The invention relates to an isolated nucleic acid encoding a eukaryotic 
SIPl, and any mutants, derivatives, variants, and fragments thereof. 
10 In one aspect, the isolated nucleic acid shares at least about 20% 

homology with at least one of huSIPl (SEQ ID N0:1) mdXeSIPl (SEQ ID N0:3). 

In another aspect, the isolated nucleic acid is selected from the group 
consisting of (SEQ ID N0:1), and (SEQ ID N0:3). 

The invention further relates to an isolated nucleic acid encoding a 
15 eukaryotic SIPl, wherein the SIPl shares at least about 20% homology with at least 
one of huSIPl (SEQ ID N0:2), and XeSIPl (SEQ ID N0:4), and any mutants, 
derivatives, variants, and fragments thereof. 

Also included in the invention is an isolated polypeptide comprising a 
eukaryotic SIPl, and any mutants, derivatives, variants, and fragments thereof. 
20 In one aspect, the SIPl shares at least about 20% homology with at least 

one of SEQ ID NO:2 and SEQ ID NO: 4. 

In another aspect, the amino acid sequence of the SIPl is at least one of 
SEQ ID N0:2 and SEQ ID N0:4. 

In another aspect, the nucleic acid fixrther comprises a nucleic acid 
25 encoding a tag polypeptide covalently linked thereto. 

In one aspect, the tag polypeptide is selected from the group consisting 
of a myc tag polypeptide, a myc-pyruvate kinase tag polypeptide, a His6 tag 
polypeptide, an influenza virus hemagglutinin tag polypeptide, a maltose binding 
protein tag polypeptide, and a glutathione-S-transferase tag polypeptide. 



-5- 



10873 v,2 



In another aspect, the nucleic acid of the invention further comprises a 
nucleic acid encoding a promoter/regulatory sequence operably linked thereto. 

In yet another aspect, the nucleic acid further comprises a nucleic acid a 
tag polypeptide. 

5 The invention also includes a cell comprising the nucleic acid of the 

invention. 

In one embodiment, the cell is a DT40 cell. 

Also included is a vector comprising the isolated nucleic acid of the 

invention. 

10 In one aspect, the vector further comprises a nucleic acid encoding a 

promoter/regulatory sequence operably linked thereto. 

Also included is a recombinant cell comprising the isolated nucleic acid 
of the invention. 

In one aspect, the recombinant cell comprises the aforementioned 

15 vector. 

In addition, the invention relates to an antisense isolated nucleic acid 
complementary to the nucleic acid of the invention. 

Further, the invention relates to a cell comprising the antisense nucleic 
acid of the invention. 

20 In addition, there is included an antibody that specifically binds to a 

eukaryotic SIPl polypeptide, or a fragment thereof 

The antibody may be selected from the group consisting of a polyclonal 
antibody, a monoclonal antibody, and a synthetic antibody. 

In a preferred embodiment, the antibody is a monoclonal antibody 
25 selected from the group consisting of 2S7 and 2E 1 7. 

The invention also relates to an isolated nucleic acid encoding a 
mammalian Ge/wmi, and any mutants, derivatives, variants, and fragments thereof 

In one aspect, the nucleic acid shares at least about 20% homology with 
human GeminS (SEQ ID N0:7). 
30 In another aspect, the isolated nucleic acid is SEQ ID N0:7. 

-6- 

10873 V.2 



The invention further relates to an isolated nucleic acid encoding a 
mammalian Gemin3, wherein the GeminS shares at least about 20% homology with 
human Gemin3 (SEQ ID N0:8), and any mutants, derivatives, variants, and fragments 
thereof. 

5 In addition, there is included an isolated polypeptide comprising a 

mammalian GeminS, and any mutants, derivatives, variants, and fragments thereof. 

In one aspect, the GeminS shares at least about 20% homology with 

SEQIDN0:8. 

In another aspect, the GeminS is SEQ ID N0:8. 
10 In another aspect, the nucleic acid further comprises a nucleic acid 

encoding a tag polypeptide covalently linked thereto. 

In one embodiment, the tag polypeptide is selected from the group 
consisting of a myc tag polypeptide, a myc-pyruvate kinase tag polypeptide, a His6 tag 
polypeptide, an influenza virus hemagglutinin tag polypeptide, a maltose binding 
1 5 protein tag polypeptide, and a glutathione-S-transferase tag polypeptide. 

In another embodiment, the nucleic acid further comprises a nucleic 
acid encoding a promoter/regulatory sequence operably linked thereto. 

Also included is a vector comprising the just-mentioned nucleic acid. 

The vector may further comprise a nucleic acid encoding a 
20 promoter/regulatory sequence operably linked thereto. 

In addition, the invneiton includes a recombinant cell comprising the 
just-mentioned nucleic acid. 

The invention also includes a recombinant cell comprising the just- 
mentioned vector. 

25 The invention further includes an antisense isolated nucleic acid 

complementary to the just-mentioned nucleic acid, and a cell comprising the same. 

In addition, the invention relates to an nntibody that specifically binds to 
a mammalian GeminS polypeptide, or a fragment thereof 

In one aspect, the antibody is selected from the group consisting of a 
SO polyclonal antibody, a monoclonal antibody, and a synthetic antibody. 

-7- 

10873 V.2 



In one embodiment, the antibody is a monoclonal antibody selected 
from the group consisting of 11G9 and 12H12. 

Also included is an antibody that specifically binds to a eukaryotic 
Survival of Motor Neurons (SMN) polypeptide, or a fragment thereof. 
5 In one aspect, the SMN is human SMN and further wherein the antibody 

is monoclonal antibody 2B1. 

In another embodiment, the SMN is chicken SMN. 

In addition, the invention includes an isolated nucleic acid encoding a 
eukaryotic SIPl, and any mutants, derivatives, variants, and fragments thereof, 
10 wherein the nucleic acid comprises a mutation that affects binding of SIPl with SMN. 

Further, the invention includes an isolated nucleic acid encoding human 
SMN, wherein the nucleic acid comprises a mutation which mutation affects binding of 
SMN with at least one of another SMN protein, a GeminS protein, and an SIPl protein. 

In addition, the invention includes nn isolated nucleic acid encoding the 
15 human SMN protein, wherein the nucleic acid comprises a mutation which mutation 
affects pre-mRNA splicing. 

The invention further includes a cell comprising the just-mentioned 

nucleic acid. 

The invention also includes an isolated nucleic acid encoding a 
20 eukaryotic SIPl, and any mutants, derivatives, variants, and fragments thereof, wherein 
the nucleic acid comprises a mutation which mutation affects binding of SIPl with 
SMN. 

The invention also includes a fusion protein comprising a tag 
polypeptide and at least a portion of an SMN protein. 
25 In one aspect, the tag polypeptide is selected from the group consisting 

of a myc tag polypeptide, a myc-pyruvate kinase tag polypeptide, a His 6 tag 
polypeptide, an influenza virus hemagglutinin tag polypeptide, a maltose binding tag 
polypeptide, and a glutathione-S-transferase tag polypeptide. 

The invention further includes a fusion protein comprising a tag 
30 polypeptide and at least a portion of an SIPl protein. 

-8- 

10873 V.2 



In addition, the invention includes a fusion protein comprising a tag 
polypeptide and at least a portion of a GeminS protein. 

The invention also includes a vector comprising a nucleic acid encoding 
human SMN wherein the nucleic acid comprises a mutation and further wherein the 
5 mutation affects SMN binding to at least one of another SMN protein, a GeminS 
protein, and an SIPl protein. 

The invenrion also includes a composition comprising an isolated 
purified SMN protein and a protein that binds specifically with SMN. 

In one aspect, the protein that binds specifically with SMN is selected 
10 firom at least one of another SMN protein, an SIPl protein, a Gemin3 protein, and an 
Sm protein. 

In one embodiment, the composition further comprising a ribonucleic 

acid. 

The invention further relates to a method of stimulating snRNP 
15 assembly. The method comprises contacting an extract comprising snRNP components 
with SMN, thereby stimulating snRNP assembly. 

Also included is a mammalian cell comprising an exogenous SMN 
modulating sequence selected from the group consisting of a nucleic acid encoding 
SMN, an antisense nucleic acid complementary to a nucleic acid encoding SMN, and a 
20 ribozyme specific for ribonucleic acid encoding SMN, wherein the cell exhibits higher 
or lower levels of SMN protein compared with an otherwise identical cell which does 
not comprise the exogenous SMN modulating sequence. 

In one embodiment, the exogenous SMN modulating sequence is an 
antisense nucleic acid and further wherein the cell exhibits a lower level of SMN 
25 protein compared with an otherwise identical cell which does not comprise the 
antisense nucleic acid. 

In another embodiment, the cell further exhibits an altered growth 
characteristic compared with an otherwise identical cell which does not comprise the 
antisense nucleic acid. 

-9- 

10873 V.2 



In yet another embodiment, the exogenous SMN modulating sequence 
is a nucleic acid encoding SMN wherein the nucleic acid encoding SMN is covalently 
linked to a nucleic acid encoding a HA tag polypeptide, and further wherein expression 
of the exogenous SMN modulating sequence inhibits expression of endogenous SMN. 
5 Also included is a method of identifying a compound which affects the 

level of SMN expression in a cell. The method comprises contacting the cell with a 
test compound and comparing the level of SMN expression in the cell with the level of 
SMN expression in a otherwise identical cell which is not contacted with the test 
compound, wherein a higher or lower level of SMN expression in the cell contacted 
10 with the compound compared with the level of SMN expression in the otherwise 
identical cell which is not contacted with the compound is an indication that the 
compound affects the level of SMN protein in the cell. 

In one aspect, the compound increases the level of SMN expression in a 

cell. 

15 In another aspect, the cell is obtained from a SMA type I patient. 

In a further aspect, the cell is selected from the group consisting of a 
fibroblast and a lymphoblastoid cell. 

In addition, the invenition includes a method of identifying a test 
compound which is a candidate SMA therapeutic. The method comprises contacting a 
20 cell with a test compound and comparing the level of SMN expression in the cell with 
the level of SMN expression in an otherwise identical cell which is not contacted with 
the test compound, wherein a lower level of SMN expression in the cell contacted with 
the test compound compared with the level of SMN expression in the otherwise 
identical cell which is not contacted with the test compoimd is an indication that the 
25 test compound is a candidate SMA therapeutic, thereby identifying a compound which 
is a candidate SMA therapeutic. 

In one aspect, the cell is obtained from a SMA type I patient. 
In another aspect, the cell is selected from the group consisting of a 
fibroblast and a lymphoblastoid cell. 



10873 V.2 



-10- 



The invention further includes a method of identifying a compound 
which affects the level of SMN expression in a cell comprising an SMN modulating 
sequence. The method comprises contacting the cell with a test compound and 
comparing the level of SMN expression in the cell with the level of SMN expression in 
5 an otherwise identical cell which is not contacted with the test compound, wherein a 
higher or lower level of SMN expression in the cell contacted with the compound 
compared with the level of SMN expression in the cell which is not contacted with the 
compound is an indication that the compound affects the level of SMN expression in 
the cell. 

10 In one aspect, the SMN modulating sequence is selected from the group 

consisting of an isolated nucleic acid encoding SMN, an antisense nucleic acid 
complementary to a nucleic acid encoding SMN, and a ribozyme specific for 
ribonucleic acid encoding SMN. 

In another aspect, the SMN modulating sequence is an antisense nucleic 

15 acid complementary to a nucleic acid encoding SMN. 

Also included in the invention is a method of identifying a compound 
useful for the treatment of SMA. The method comprises contacting a cell comprising 
an antisense nucleic acid complementary to a nucleic acid encoding SMN with a test 
compound and comparing the level of SMN expression in the cell with the level of 

20 SMN expression in an otherwise dentical cell which is not contacted with the test 
compound, wherein a higher level of SMN expression in the cell contacted with the 
compound compared with the level of SMN expression in the cell which is not 
contacted with the compound is an indication that the compound is useful to treat 
SMA, thereby identifying a compound useful for the treatment of SMA. 

25 In addition, the invention relates to a method of assessing whether a test 

compound affects binding of SMN with a protein that specifically binds with SMN. 
The method comprises (a) making a first preparation comprising a surface having at 
least a portion of SMN bound thereon, the test compound, and a labeled protein that 
specifically binds with SMN; (b) assessing the amount of the labeled protein bound 

30 with the surface in the first preparation; and (c) comparing the amount of the labeled 

-11- 

10873 V.2 



protein bound with the surface in the first preparation and the amount of labeled 
protein bound with the surface in an otherwise identical preparation to which the test 
compound is not added, whereby a difference between the amount of labeled protein 
bound with the surface in the first preparation and in the otherwise identical 
5 preparation is an indication that the test compound affects the binding of SMN with a 
protein that specifically binds with SMN. 

In one aspect, the protein that specifically binds with SMN is selected 
fi-om the group consisting of another SMN protein, a SIPl protein, a GeminS protein, a 
SmB protein, a SmB' protein, a SmDl protein, a SmD2 protein, and a SmD3 protein. 

10 The invention fiirther relates to a method of assessing whether a test 

compound is usefiil for treatment of SMA. The method comprises (a) making a first 
preparation comprising a surface having at least a portion of SMN bound thereon, the 
test compound, and a labeled protein that specifically binds with SMN; (b) assessing 
the amount of the labeled protein bound with the surface in the first preparation; and 

15 (c) comparing the amount of the labeled protein bound with the surface in the first 
preparation and the amount of labeled protein bound with the surface in an otherwise 
identical preparation to which the test compound is not added, whereby a lower amount 
of the labeled protein bound with the surface in the first preparation and in the 
otherwise identical preparation is an indication that the test compound is useful for 

20 treatment of SMA. 

In one aspect, the protein that specifically binds with SMN is selected 
fi-om the group consisting of another SMN protein, a SIPl protein, a Gemin3 protein, a 
SmB protein, a SmB' protein, a SmDl protein, a SmD2 protein, and a SniDS protein. 

Also included is a method of enhancing splicing of mRNA. The method 

25 comprises incubating an in vitro pre-mRNA processing extract in the presence of 
SMN, or any mutant, derivative, variant, and fi-agment thereof, thereby enhancing 
splicing of the mRNA. 

In addition, the invention includes a method of identifying a compound 
that affects pre-mRNA splicing. The method comprises incubating an extract capable 

30 of pre-mRNA splicing in the presence or absence of a test compound and comparing 

-12- 

10873 V.2 



the level of pre-mRNA splicing in the extract in the presence of the test compound with 
the level of splicing of pre-mRNA in the absence of the test compound, wherein a 
higher or a lower level of pre-mRNA splicing in the extract in the presence of the test 
compound, compared with the level of pre-mRNA splicing in the extract in the absence 
5 of the test compound, is an indication that the test compoimd affects pre-mRNA 
splicing. 

In addition, the invention relates to a method of identifying a test 
compound that is useful to treat SMA. The method comprises incubating an extract 
capable of pre-mRNA splicing in the presence or absence of a test compound and 

10 comparing the level of pre-mRNA splicing in the extract in the presence of the test 

compound with the level of splicing of pre-mRNA in the absence of the test compound, 
wherein a liigher level of pre-mRNA splicing in the extract in the presence of the test 
compound, compared with the level of pre-mRNA splicing in the extract in the absence 
of the test compound, is an indication that the test compound is useful to treat SMA. 

15 The invention further relates to a method of identifying a compound that 

affects snRNP assembly. The method comprises incubating an extract capable of 
snRNP assembly in the presence or absence of a test compound and comparing the 
level of snRNP assembly in the extract in the presence of the test compound with the 
level of snRNP assembly in the absence of the test compound, wherein a higher or a 

20 lower level of snRNP assembly in the extract in the presence of the test compound, 
compared with the level of snRNP assembly in the extract in the absence of the test 
compound, is an indication that the test compound affects snRNP assembly. 

In addition, there is provided a method of identifying a test compound 
that is useful to treat SMA. The method comprises incubating an extract capable of 

25 snRNP assembly in the presence or absence of a test compound and comparing the 
level of snRNP assembly in the extract in the presence of the test compound with the 
level of snRNP assembly in the absence of the test compoimd, wherein a higher level 
of snRNP assembly in the extract in the presence of the test compound, compared with 
the level of snRNP assembly in the extract in the absence of the test compound, is an 

30 indication that the test compound is useful to treat SMA. 

-13- 

10873 V.2 



The invention further includes a method of assessing the presence or 
degree of SMA in a mammaL The method comprises obtaining a biopsy comprising 
motor neurons from the mammal and assessing the nimiber and morphology of gems in 
the motor neurons, wherein a lower number of gems in the motor neurons, compared 
5 with the number of gems in motor neurons obtained from an otherwise identical 
mammal which does not have SMA, is an indication that the mammal has SMA, and 
further wherein the absence of or the presence of a minimal number of gems in the 
mammal having SMA is directly related to the severity of the SMA in the mammaL 

The is further provided in the invention a method of assessing the 
10 presence or degree of SMA in a mammal. The method comprises comparing the level 
of binding of SMN obtained from the mammal to a protein that specifically binds with 
SMN with the level of binding of SMN wild type to an identical protein that 
specifically binds with SMN, wherein a lower level of binding of the SMN from the 
mammal to the protein that specifically binds with SMN compared with the level of 
1 5 binding of SMN wild type with the identical protein that specifically binds with SMN 
is an indication of the presence or degree of SMA in a mammal. 

In one aspect, the protein that specifically binds with SMN is selected 
from the group consisting of an SMN protein, an SIPl protein, and a Gemin3 protein. 

The invention additionally includes a knock-out targeting vector, the 
20 vector comprising a first nucleic acid portion encoding a sequence 5* of the open 
reading frame encoding SMN and a second nucleic acid portion encoding a nucleic 
acid sequence 3' of the open reading frame encoding SMN. 

In one aspect, the SMN is chicken SMN (SEQ ID N0:9). 

In another aspect, the vector further comprises a nucleic acid encoding a 
25 selectable marker covalently linked thereto. 

In one aspect, the first and second nucleic acid portions flank the nucleic 
acid encoding the selectable marker. 

Also included is a recombinant cell comprising the aforementioned 
knock-out targeting vector. 

-14- 

10873 V.2 



The cell amy further comprise a vector comprising an isolated nucleic 
acid encoding SMN. 

In one embodiment, the cell is a chicken pre-B lymphoid DT40 cell. 

In addition, the invnetion includes a method of identifying a compound 
5 that affects SMN expression in a cell. The method comprises contacting the just- 
mentioned cell with a test compound and comparing the level of SMN expression in 
the cell with the level of SMN expression in an otherwise identical cell which is not 
contacted with the test compound, wherein a higher or lower level of SMN expression 
in the cell contacted with the test compound compared with the level of SMN 
10 expression in the otherwise identical cell which is not contacted with the compoimd is 
an indication that the compound affects SMN expression in a cell, thereby identifying a 
compound that affects SMN expression in a cell. 

There is also provided a method of identifying a compound that is 
useful to treat SMA. The method comprises contacting the aforementioned cell with a 
15 test compound and comparing the level of SMN expression in the cell with the level of 
SMN expression in an otherwise identical cell which is not contacted with the test 
compound, wherein a higher level of SMN expression in the cell contacted with the test 
compound compared with the level of SMN expression in the otherwise identical cell 
which is not contacted with the compound is an indication that the compound increases 
20 SMN expression in a cell, thereby identifying a compound that is useful to treat SMA. 

In addition, the invention includes a method of identifying a compound 
useful for the treatment of SMA. The method comprises contacting the 
aforementioned cell with a test compound and comparing the level of growth of the 
cell with the level of growth of an otherwise identical cell which is not contacted with 
25 the test compound, wherein a higher level of growth of the cell contacted with the 
compound compared with the level of growth of the cell which is not contacted with 
the compound is an indication that the compound is useful to treat SMA. 

Also included in the invention is an isolated nucleic acid encoding a 

chicken SMN. 

-15. 

10873 V.2 



10 



In one aspect, the nucleic acid shares at least about 20% homology with 

SEQ ID N0:9. 

Further included is an isolated nucleic acid encoding chicken SMN, 
wherein the chicken SMN shares at least about 20% homology with SEQ ID NO: 10. 

In addition, the invention includes an isolated polypeptide comprising 

chicken SMN. 

In one aspect, the SMN shares at least about 20% homology with SEQ 

ID NO: 10. 

In another aspect, the SMN is SEQ ID NO: 10. 



BRIEF DESCRIPTION OF THE DRAWINGS 
The file of this patent contains at least one drawing executed in color. 
Copies of this patent with color drawing(s) will be provided by the Patent and 
Trademark Office upon request and payment of the necessary fee. 

15 Figure 1 is a diagram depicting the amino acid sequence alignment of 

human SIPl (huSIPl) (also referred to as Gemin2) (SEQ ID NO:2) and oiXenopus 
laevis SIPl (XeSIPl) (SEQ ID N0:4) sequences. Also shown is the amino acid 
sequence aUgnment of SIPl with the S, cerevisiae Brrl protein (SEQ ED NO:6). The 
boxes indicate identical amino acids, and the borderless gray boxes indicate similar 

20 amino acids. 

Figure 2A is an image of an SDS-PAGE gel depicting the interaction of 
SIPl with SMN in vitro. SIPl interacted with immobilized GST-SMN chimeric 
protein in vitro. The interaction was resistant to a 1 M salt (NaCl) wash. Under the 
same conditions, SIPl did not bind with GST alone even at low salt (200 mM NaCl) 
25 concentration. 

Figure 2B is an image of a Western blot depicting the fact that SIPl and 
SMN are associated with each other in vivo, Immunoprecipitation of total HeLa extract 
was accomplished using monoclonal antibodies 2B1 directed against SMN and 2S7 
directed against SIPl . The immunoprecipitated proteins were analyzed by Westem 
30 blot using 2S7 and 2B1 (lane 2B1 IP and lane 2S7 IP, respectively). 

.16- 

10873 V.2 



Figure 2C is an image of a Western blot depicting the formation of a 
complex by SIPl and SMN of approximately 300 kDa or more in the cytoplasm. HeLa 
cytoplasmic SI GO extract was fractionated on the basis of protein size on a G3000-SW 
column. The fractions were analyzed by SDS-PAGE, and the SMN and SIPl proteins 
5 were detected by Western blot. 

Figure 3 A is an image depicting the co-localization of SIPl and SMN 
in gems. The image depicts a light microscopic image of an indirect 
immunofluorescence experiment on HeLa cells using monoclonal antibody 2S7 against 
the SIPl protein. Note the general cytoplasmic staining and discrete nuclear structures. 
10 Figure 3B is an image depicting a contrast image of the same field as 

depicted in Figure 3 A. 

Figure 3C is an image depicting superimposed laser confocal images of 
double-label immimofluorescence microscopy experiments using antibodies against 
SIPl (green) and antibodies against coiled body marker p80-coilin (red). Co- 
15 localization of red and green results in a yellow color. 

Figure 3D is an image depicting superimposed laser confocal images of 
double-label immunofluorescence microscopy experiments using antibodies against 
SIPl (green) and antibodies against coiled body marker, fibrillarin (red). Co- 
localization of red and green results in a yellow color. 
20 Figure 3E is an image depicting superimposed laser confocal images of 

double-label immunofluorescence microscopy experiments using antibodies against 
SIPl (green) and anti-Sm antibody Y12 (red). Co-locaUzation of green and red results 
in a yellow color. 

Figure 3F is an image depicting the superimposed confocal images of 
25 double-label immunofluorescence microscopy experiments using monoclonal antibody 
against SIPl (green) and a rabbit polyclonal antiserum raised against exon 7 of the 
human SMN protein (red). Co-localization of green and red results in a yellow color. 

Figure 4A is an image of an SDS-PAGE gel depicting the 
coimmunoprecipitation of SMN- and SIPl -containing complexes with monoclonal 
30 antibodies against SMN and SIPl. The image demonstrates that the 

-17- 

10873 V.2 



coimmunoprecipitation of SMN and SIPl obtained from [S]methionine-labeled HeLa 
total cell extract by monoclonal antibodies 2B1 (lane 2B1) and 2E17 (lane 2EI7) 
resulted in a similar protein pattem. Control anti-Sm monoclonal antibody Y12 
immunoprecipitated Sm proteins and some snRNP-specific proteins (lane Y12), while 
5 negative control antibody SP2/0 illustrates the background of immunoprecipitation 
(lane SP2/0). 

Figure 4B is an image depicting a two-dimensional non-equilibrium pH 
gradient gel electrophoresis (NEPHGE) analysis of SMN-immunoprecipitated complex 
(2B1 panel) and the Sm-immunoprecipitated complexes containing core Sm proteins 

10 and some U snRNP-specific proteins from HeLa nuclear fractions inmiunoprecipitated 
with monoclonal antibody Y12 (Y12 panel). The dashed boxes indicate background 
proteins that were also detected in control SP2/0 immunoprecipitations. 

Figure 5A is an image of a gel depicting the fact that SMN protein 
bound directly with Sm B/B\ Dl-3, and E proteins. Purified snRNP proteins were 

15 analyzed on SDS-PAGE (lane designated "snRNPs") and were transferred to a 

nitrocellulose membrane. The membrane was then probed with 2x10^ cpm of in vitro 
translated "^^[SJ-labeled SMN protein. After washing away the nonspecific binding 
proteins, the membrane was exposed to an X-ray film (lane "^^[S]SMN"). The three D 
Sm proteins could not be resolved using this gel system. The data disclosed herein 

20 demonstrate that SMN has two distinct binding domains for the Sm proteins and for 
SIPl, respectively. 

Figure 5B is an image of a gel depicting binding of Sm proteins with 
SMN in vitro, ^^[S]-labeled, in vzYra-translated Sm proteins B, Dl, D2, D3, E, F, and G 
were incubated for 30 minutes at 4°C with GST-SMN immobilized on glutathione- 

25 Sepharose beads. The beads were then washed six times with binding buffer as 

described elsewhere herein. Proteins that remained boxmd to the beads after washing 
were eluted by boiling the beads in SDS-PAGE sample buffer and the proteins were 
analyzed by SDS-PAGE followed by fluorography. 

Figure 5C is an image of a gel depicting the competition for SMN 

30 binding by two polypeptide fragments of SMN which were conjugated to BSA. In 

-18- 

10873 V.2 



vitro translated [S]-labeled Sm B and Sff 1 proteins were mixed, and binding to GST- 
SMN immobilized on glutathione-Sepharose beads was performed as described in 
Figure 5B herein, either in the absence (-) or presence of BSA-coupled SMN peptide 
fragments corresponding to amino acids 13-44 (PI) or 240-267 (P2) of human SMN 
5 protein or to an unrelated control sequence from HIV-1 Rev NES (ctrl). After washing 
the beads, bound proteins were eluted by boiling the beads in SDS-PAGE sample 
buffer and the eluted proteins were analyzed by SDS-PAGE followed by fluorography. 
The data disclosed herein demonstrate that SMN has two distinct, non-overlapping 
binding domains for SIPl (domain PI) and for the Sm proteins (domain P2), 

10 respectively, as depicted in the illustration at the bottom of this figure. 

Figure 6A is an image of a Westem blot depicting the presence of SMN 
and SIPl in the cytoplasm of Xenopus oocytes and the association of the proteins with 
spliceosomal snRNAs. Protein obtained from total Xenopus somatic cells (XL- 177 cell 
line) or from oocytes dissected into nucleus and cytoplasm (O. Nuc, and O. Cyto, 

15 respectively) were fractionated by SDS-PAGE and analyzed by Westem blotting with 
anti-SMN antibody (2B1) or anti-SIPl (2E17). 

Figure 6B is an image of a gel depicting the immunoprecipitation of 
spliceosomal U snRNAs using anti-SIPl and anti-SMN antibodies. Selected mixtures 
of the indicated in vitro synthesized ^^[P]-labeled RNAs were injected into the 

20 cytoplasm of oocytes. Three hours later, immunoprecipitations were carried out using 
either anti-SMN antibody 2B1 or anti-SIPl antibody 2E17. Immunoprecipitated RNA 
(IP) was analyzed by gel electrophoresis. The supematants (SUP) of the 2B1 
immunoprecipitations are depicted and the supematants of the 2E17 
immunoprecipitations were similar to those of 2B1. 

25 Figure 7 is an image of a gel depicting the fact that anti-SMN and anti- 

SIPl antibodies immunoprecipitated Ul and U5 snRNAs obtained only from the 
cytoplasm oi Xenopus oocytes. A mixture of ^^[P]-labeled Ul and U5 snRNAs was 
injected into the cytoplasm of oocytes. After incubation for 20 hours, the oocytes were 
dissected into nuclear (N) and cytoplasmic (C) fractions, as depicted in the illustration 

30 at the top of the figure. RNAs from both fractions were immunoprecipitated (IP) using 

-19- 

10873 V.2 



either the anti-Sm antibody Y12, anti-SMN antibody 2B1, or anti-SIPl antibody 2E17. 
One-fifth of the total supernatant (SUP) was loaded on the gel. 

Figure 8 A is an image of a gel depicting the fact that anti-SIPl 
antibodies interfere with the biogenesis cycle of spliceosomal U snRNPs. Oocytes 
5 received a cytoplasmic injection of either water (-), anti-SMN antibody (2B 1 ), anti-Sm 
antibody (Y12), or anti-SIPl antibody (2E17). The same oocytes were injected in the 
nucleus 1 hour later with a mixture of ^^[P]-labeled Ul, U5, and U6 snRNA. Fifteen 
hours later, the oocytes were dissected into nuclear (N) and cytoplasmic © fractions, 
and the injected RNAs were analyzed by electrophoresis on an RNA gel. 

10 Figure 8B is an image of a gel depicting inhibition of nuclear import of 

all spUceosomal snRNAs by anti-SIPl antibodies. In vitro transcribed, ^^[P]-labeled 
snRNAs Ul, U2, U4, and U5 were injected into the cytoplasm of oocytes either alone 
(-) or together with anti-SIPl antibody (2E17). The oocytes were incubated for an 
additional 15 hours and they were then dissected into nuclear (N) and cytoplasmic (C) 

15 fractions. Injected ^^[P]-labeled RNAs were isolated and analyzed by electrophoresis 
on a denaturing RNA gel. 

Figure 9A is an image of a gel depicting the fact that anti-SIPl and 
anti-SMN antibodies affected the assembly of the Sm core domain of spliceosomal 
snRNPs. Anti-SIPl antibodies inhibited the assembly of the Sm core domain of all 

20 spliceosomal U snRNPs. Injections into the oocyte cytoplasm of either water (-), anti- 
Sm antibody (Y12), anti-SIPl antibody (2E17), anti-SMN antibody (2B1), or control 
antibody (SP2/0) were performed. One hour later, the same oocytes were injected a 
second time in the cytoplasm with a mixture of ^^[P]-labeled Ul, US, and U6 snRNAs. 
In the experiment shown in the lane designated "SIPl post'*, Ul, U5, and U6 snRNAs 

25 were injected 1 hour prior to the injection of anti-SIPl antibody. After incubation for 
an additional 1 hour, the oocytes were homogenized and the RNAs were 
immunoprecipitated with anti-Sm antibody Y12 (IP Y12). Immunoprecipitated RNAs 
were analyzed by electrophoresis on an RNA gel with the corresponding supematants 
(SUP Y12). 

-20- 

10873 V.2 



Figure 9B is an image of a gel depicting stimulation of the fomiation of 
the Sm core domain by the anti-SMN monoclonal antibody 2B1 . The data disclosed 
herein demonstrate stimulation of Sm protein binding to U5 snRNA in the presence of 
high concentrations of the anti-SMN antibody 2B1. A mixture of ^^[P]-labeled U5 and 
5 U6 snRNAs was injected into the cytoplasm of oocytes either with water (-) or with 
anti-SMN and anti-SIPl antibodies (2B1 and 2E17, respectively) at either high (h, 
S^ig/^il) or low (1, l\ig/\i\) concentrations. After a 1 hour incubation, the oocytes were 
homogenized, and the RNAs were immunoprecipitated with the anti-Sm antibody Y12 
(IP Y12). Irmnunoprecipitated RNAs were analyzed by electrophoresis on an RNA gel 

10 alongside one-tenth of the supematants (SUP Y12). 

Figure 1 OA is an image depicting the fact that SMN interacts directly 
with itself and with SmB, and that these interactions are affected by mutations found in 
SMA patients. A binding assay of His-tagged SMN and SmB recombinant proteins (2 
mg) with either GST or GST-SMN was performed as described elsewhere herein. 

15 Bound SMN and SmB were analyzed by SDS-PAGE and Western blotting with an 
anti-T7 tag antibody. Ten percent of the input is shown in the first lane. 

Figure lOB is an image depicting the fact that SMN interacts directly 
with itself and with SmB, and that these interactions are affected by mutations foimd in 
SMA patients. In vitro translated [^^S]methionine-labeled, myc-tagged SMN wild-type 

20 and mutant proteins were incubated with the indicated purified GST-fusions (SMN, 
SmB, or SIP 1) as described elsewhere herein. Bound proteins were analyzed by SDS- 
PAGE and fluorography. Twenty percent of the input is depicted in the in vitro 
translation area. The area corresponding to GST-SmB binding is a 3 -fold-longer 
exposure than the other areas. 

25 Figure 11 A is an image depicting the fact that SMN self-association 

specifically increases the binding affinity for Sm protems, GST or GST-SMN was 
preincubated with or without a 4-fold molar excess of His-SMN as described elsewhere 
herein. After washing away unbound His-SMN, in vitro translated [^^Sjmethionine- 
labeled SMN, SmB, or SIPl was added and the binding assay was performed as 

-21- 

10873 V.2 



described elsewhere herein. Bound proteins were analyzed by SDS- PAGE and 
fluorography. Ten percent of the input is shown in the in vitro translation area. 

Figure 1 IB is an image depicting the fact that SMN self-association 
specifically increases the binding affinity for Sm proteins. GST-SMN was 
5 preincubated with or without a 4-fold molar excess of His-SMN as described above in 
Figure 11 A, Then, in vitro translated [^^S]methionine-Iabeled Sm proteins were 
added, and the binding assay was performed as described elsewhere herein. Twenty- 
five percent of the input is shown in the in vitro translation area. 

Figure 12 A is an image depicting titration analysis of the effect of SMN 

10 self-association on SmB binding. GST-SMN was preincubated with the indicated 
molar excess of His-tagged SMNvrt, SMNY272C, or SMNAEx7. After washing 
unbound recombinant proteins, in vitro translated [^^Sjmethionine-labeled SmB was 
added and binding was performed as described elsewhere herein. Each binding assay 
was analyzed by Western blotting using anti-T7 tag mAb to detect bound His-tagged 

15 SMN proteins (SMN) and by autoradiography to detect bound SmB (lower band 
designated Sm B). 

Figure 12B is an image depicting titration analysis of the effect of SMN 
self-association on SmB binding. GST-SMNY272C was preincubated with the 
indicated molar excess of His-SMNY272C and processed further as described 

20 previously elsewhere herein. 

Figure 13A is an image depicting the fact that SMN mutants found in SMA 
patients are defective in oligomerization and interaction with Sm proteins. SMN wild 
type but not SMN mutants of SMA patients form oligomers. Purified recombinant 
His-tagged SMN, SMNY272C, SMNAEx7, and SmB proteins were analyzed 

25 individually by HPLC gel filtration as described elsewhere herein. The fi-actions 
obtained therefi-om were analyzed by SDS-PAGE, and the proteins were detected by 
Western blotting. The indicated positions of the molecular mass markers were 
determined by independent colimin chromatographies. 

Figure 13B is an image depicting the fact that SMN oligomers bind to 

30 SmB. The indicated mixtures of purified recombinant His-tagged SMN or SMNAEx7 

-22- 

10873 V.2 



with His-tagged SmB were analyzed by HPLC gel filtration. The fractions were 
analyzed by SDS-PAGE, and the proteins were detected by Western blotting. The 
indicated positions of the molecular mass markers were determined by independent 
colimm chromatographies. 
5 Figure 14 is an image depicting the reduced association of SMNY272C 

and SMNAEx? with Sm proteins in vivo. 293T cells were transiently transfected with 
the indicated myc-tagged SMN constructs and cell protein extracts were analyzed by 
coimmunoprecipitation. Briefly, total cell extracts were immunoprecipitated with 
either anti-Sm (Y12) or anti-SIPl (2E17) monoclonal antibodies (mAbs). Total cell 

10 extracts (10% of the input) and the anti-Sm immunoprecipitates were analyzed by 
Westem blotting with anti-myc (9E10) and Y 12 antibodies; the anti-SIPl 
immunoprecipitates were analyzed by Western blotting with 9E10 and 2E17. A 
relatively low amount of antibody was used in the immunoprecipitations to improve 
the detection of SmB/B' and SIPl over the light chains of the immunoglobulins, which 

15 migrate very closely. The transfected proteins were overexpressed approximately 5- to 
10- fold compared with endogenous SMN. Only SmB is depicted because other Sm 
proteins were not detected by Westem blotting using Y12 antibody. The transfected 
SMNAEx? migrated closer to full-length SMN on a 12.5% polyacrylamide SDS- 
PAGE. 

20 Figure ISA is a diagram depicting the structure of the SMN protein and 

its interacting domain. The amino acid nxmibers and the borders of exons are 
indicated. SIPl -interacting domain resides at the amino terminus of SMN as 
determined by competition experiments (see, e,g.. Example 1). SMN self-association 
and SMN/Sm interaction domains overlap with the conserved YG box at the carboxyl- 

25 terminus of SMN as detemiined by deletion, mutation, and competition experiments. 

Figure 15B is a diagram depicting the interactions of SMN/SIPl with 
Sm proteins. Monomeric SMN, associated with SIPl, which binds to SMN but not to 
itself, contains a low-affinity binding site for Sm proteins. SMN self-associates, 
forming at least a SMN/SIPl tetrameric complex. In this oligomeric conformation, a 

30 binding site is formed with a much higher affinity for the Sm proteins. SMN mutations 

-23- 

10873 V.2 



found in SMA patients result in a reduced ability of SMN to self-associate (Burghes, 
1997, Am. J. Hum. Genet. 61:9-15) and also map within the Sm-binding site itself 
(Brzustowicz et al., 1990, Nature 344:540-541), thus affecting the SMN interaction 
with Sm proteins. 

5 Figure 16A is an image depicting the redistribution of Sm proteins in 

cells expressing the SMNAN27 deletion mutant. The image depicts immunostaining 
of HeLa cells transiently transfected with myc-SMN wt using anti-myc tag 
monoclonal antibody 9E10 (green). 

Figure 16B is an image depicting the immunostaining of HeLa cells 
10 transiently transfected with myc-SMN wt using the anti-Sm monoclonal antibody Yl 2 
(red). 

Figure 16C is an image depicting double-label confocai 
immimofluorescence combining the images depicted in Figures 16A (anti-myc [green]) 
and 16B (anti-Sm [Y12] red) of HeLa cells transiently transfected with myc-SMN wt. 
15 Co-localization of the green and red signals results in a yellow signal. The nuclear 
gems are indicated by arrows and the dashed line demarcates the nucleus. 

Figure 16D is an image depicting the redistribution of Sm proteins in 
cells expressing the SMNAN27 deletion mutant. The image depicts immunostaining 
of HeLa cells transiently transfected with myc-SMNAN27 using anti-myc tag 
20 monoclonal antibody 9E 1 0 (green). 

Figure 16E is an image depicting the immunostaining of HeLa cells 
transiently transfected with SMNAN27 using the anti-Sm monoclonal antibody Y12 
(red). 

Figure 16F is an image of double-label confocai immunofluorescence 
25 combining the images depicted in Figures 16C (anti-myc [green]) and 16D (anti-Sm 
[Y12] red) on HeLa cells transiently transfected with myc-SMNAN27. Co-localization 
of the green and red signals results in a yellow signal. The cytoplasmic accumulations 
are indicated by arrows and the dashed Hne demarcates the nucleus. 

Figure 17A is an image depicting the distribution of SMNAN27 in cells 
30 expressing the SMNAN27 deletion mutant protein. This image depicts the 

-24- 

10873 V.2 



immunostaining of HeLa cells transiently transfected with myc-SMNAN27 using anti- 
myc tag monoclonal antibody 9E10 (green). 

Figure 17B is an image depicting the redistribution of U snRNAs in 
cells expressing the SMNAN27 deletion mutant. This image depicts in situ 
5 hybridization demonstrating U2 snRNA localization using a U2-specific antisense 
oligo probe (red). 

Figure 17C is an image depicting the co-localization of SMNAN27 and 
U2 snRNAs in cells expressing the SMNAN27 deletion mutant protein. This image 
depicts double-label confocal immunofluorescence using anti-myc tag monoclonal 
10 antibody 9E10 (green) and U2-specific antisense oligo probe (red). Co-localization 
results in a yellow signal. The cytoplasmic accumulations are indicated by arrows and 
the dashed line demarcates the nucleus. 

Figure 17D is an image depicting the distribution of SMNAN27 in cells 
expressing the SMNAN27 deletion mutant protein. This image depicts the 
15 immunostaining of HeLa cells transiently transfected with myc-SMNAN27 using an 
anti-myc tag rabbit affinity-purified polyclonal antibody against the 9E10 epitope 
(green). 

Figure 17E is an image depicting the cap hypermethylation of snRNAs 
in the cytoplasm of cells transiently transfected with SMNAN27 deletion mutant. This 
20 imagedepictstheimmunostainingofHeLacellstransiently transfected with myc- 
SMNAN27 using anti-TMG cap monoclonal antibody K121 (red). 

Figure 17F is an image depicting the co-localization of SMNAN27 and 
TMG capped snRNAs in cells expressing the SMNAN27 deletion mutant protein. This 
image depicts double-label confocal immunofluorescence using an anti-myc tag rabbit 
25 affinity purified polyclonal antibody against the 9E 1 0 epitope (green) and anti-TMG 
cap monoclonal antibody K121 (red). Co-localization of green and red signals resuhs 
in a yellow signal. The cytoplasmic accumulations are indicated by arrows and the 
dashed line demarcates the nucleus. 

Figure 18A is an image depicting the distribution SMN wild type in 
30 cells transiently transfected myc-SMNwt. HeLa cells transiently transfected with 

-25- 

10873 V.2 



mycSMNwt were immunostained using anti-myc tag monoclonal antibody 9E10 (green 
signal). 

Figure 18B is an image depicting the localization of p80-coilin in cells 
expressing mycSMNwt tag recombinant protein. This image depicts a HeLa cell 
5 transiently transfected with mycSMN immunostamed using anti-p80 coilin rabbit 
polyserum R288 (red signal). 

Figure 18C is an image depicting double-label confocal 
immunofluorescence demonstrating co-iocalization of SMNwt and coilin in cells 
transiently transfected with mycSMN recombinant. HeLa cells transiently transfected 
10 with mycSMNwt were immunostained using anti-myc tag (Figure 18 A) and anti-p80 
coilin (Figure 18B) and the two images were combined. Co-localization of red and 
green signals results in a yellow signal. The nuclear gems are indicated by arrows and 
the dashed line demarcates the nucleus. 

Figure 18D is an image depicting the distribution SMNAN27 
15 recombinant in cells transiently transfected with myc-SMNAN27. HeLa cells 

transiently transfected with myc SMNAN27 were immunostained using anti-myc tag 
monoclonal antibody 9E10 (green signal). 

Figure 18E is an image depicting the localization of p80-coilin in cells 
expressing myc SMNAN27 tag recombinant protein. This image depicts a HeLa cell 
20 transiently transfected with mycSMNAN27 immunostained using anti-p80 coilin rabbit 
polyserum R288 (red signal). 

Figure 18F is an image depicting double-label confocal 
immunofluorescence demonstrating co-localization of SMNAN27 and coilin in ceils 
transiently transfected with mycSMN recombinant. HeLa cells transiently transfected 
25 with mycSMNAN27 were immunostained using anti-myc tag (Figure 1 8D) and anti- 
p80 coilin (Figure 1 8E) and the two images were combined. Co-localization of red and 
green signals results in a yellow signal The cytoplasmic accumulations are indicated 
by arrows and the dashed line demarcates the nucleus. 

Figure 19A is an image of a gel depicting the effect of SMN wild type 
30 and SMNAN27 recombinant proteins on pre-mRNA splicing in vitro. This image 

-26- 

10873 V.2 



depicts in vitro splicing assays in the presence of SMN wild type and SMNAN27 
recombinant proteins. [a-^^P]-labeled chicken 5-crystallin pre-mRNA was incubated 
for 20 or 40 minutes at 30°C with 50 micrograms of HeLa cell nuclear extracts (13 
miUigrams per milliliter) in the presence of buffer D (lanes 2 and 3), or 37.5 
5 micrograms per milliliter of recombinant SMN wt (lanes 4 and 5) or 37.5 micrograms 
per milliliter of SMNAN27 (lanes 6 and 7). The RNA products were analyzed by gel 
electrophoresis. 

Figure 19B is an image of an RNA gel depicting in vitro splicing assays 
after pre-incubation with SMN wild type and SMNAN27 deletion recombinant 

10 proteins. HeLa cell nuclear extracts were first incubated for 20 minutes at 30^C in the 
presence of buffer D (lanes 2 and 3), or 37.5 micrograms per milliliter of SMN wt 
(lanes 4 and 5) or 37.5 micrograms per milliliter of SMNAN27 (lanes 6 and 7). [a-^^P] 
labeled chicken 5-crystaIIin pre-mRNAs were added and the reactions were incubated 
further at 30*^0 for the indicated additional times. RNA products were analyzed by gel 

15 electrophoresis. The structures of the splicing products are shown schematically on the 
right-hand side of the figure. 

Figure 20 is an image depicting a gel demonstrating the effect of SMN 
recombinant proteins on spliceosome formation in vitro. Splicing reactions were 
carried using [a-^^P]-labeled chicken 5-crystallin pre-mRNA without (lanes 1 and 2) or 

20 with (lanes 3-5) preincubation. SpHcing complexes were fi-actionated using 

electrophoresis on a native polyacrylamide gel and the complexes were visualized 
using autoradiography as described elsewhere herein. 

Figure 21 is an image depicting a gel demonstrating the effect of pre- 
incubation with of anti-SMN antibodies on pre-mRNA splicing in vitro. Two 

25 micrograms of the following purified antibodies were added to the splicing reaction 
mixture: Y12 (anti-Sm), anti-SMN monoclonal (2B1), anti-SMN exon 7 (aEx7), and 
control antibody (SP/0). [a-^^P]-labeled chicken 5-crystallin pre-mRNA was added to 
the splicing reaction mixture and the mixture was incubated for an additional 20 or 40 
minutes at 30°C. The RNA products were analyzed by gel electrophoresis. A 

-27- 

10873 V.2 



schematic representation of the structure of each radiolabeled RNA product is depicted 
along the right-hand side of the figure. 

Figure 22 is an image depicting a gel demonstrating the effect of 
SMNwt, SMNAN27, SMNY272C, and SMNAEx? recombinant proteins on in vitro 
5 splicing of adenovirus 2 major late pre-mRNA (Ad-2 AIVS). The splicing reaction 
mixture was preincubated using either buffer D (lane 2) or 20 micrograms per millihter 
of recombinant protein as indicated in lanes 3-6. [a-^^P] -labeled adenovirus 2 major 
late pre-mRNA was then added to the mixture and the samples were incubated for 40 
minutes at 30^C. The RNA products were analyzed by gel electrophoresis. A 
10 schematic representation of the structure of each radiolabeled RNA product is depicted 
along the right-hand side of the figure. The total inhibition of pre-mRNA cleavage by 
SMNAN27 was not as complete as that disclosed previously elsewhere herein (e.g^.. 
Figure 19 A) because of the lower amount of recombinant protein used in this 
experiment. 

15 Figure 23 is a diagram depicting a model of the role of SMN in pre- 

mRNA splicing. Without wishing to be bound by any particular theory, this diagram 
depicts the nuclear function(s) of SMN, and of gems and coiled bodies, in recycling 
snRNPs after pre-mRNA splicing as discussed more fully elsewhere herein. 

Figure 24 A is an image of a gel depicting immunoprecipitation of the 

20 SMN complex using anti-SMN monoclonal antibody 2B1 from [35S]methionine 
labeled HeLa cell lysate. The immunoprecipitated proteins were analyzed by SDS- 
PAGE and autoradiography (24 hours exposure). Antibody 2B1 (lane 2B1) 
immunoprecipitated SMN, Gemin2, Sm proteins B ,B', Dl-3, F, F and G, and a group 
of proteins indicated as pl75, pl05, p97, p95, p60 and p50. The SP2/0 lane 

25 demonstrates the backgroxmd of immimoprecipitation (lane SP2/0). 

Figure 24B is an image depicting a longer exposure (36 hours) of the 
bottom part of the gel depicted in Figure 24A. The longer exposure detects Sm 
proteins more clearly than the shorter 24 hour exposure. The position of the molecular 
weight markers is indicated on the left of SP2/0 lane in kilodaltons (kDa). 

-28- 

10873 v,2 



Figure 25A is a diagram depicting sequencing of Gemin3 using 
nanoelectrospray tandem mass spectrometry. A selected portion of the spectrum of the 
unseparated in-gel tryptic digest of pi 05 band is depicted. Peptide ions designated by 
"T" are autolysis products of trypsin and were identified by comparison with the 
5 spectrum acquired from the control sample. Other peptide ions observed in the 
spectrum were, in tum, isolated by the first mass analyzer of a triple quadrupole 
instrument, fragmented in the collision cell and their tandem mass spectra was 
acquired. Upon searching a comprehensive protein sequence database using tandem 
mass spectrometric data, peptide ions designated with filled triangles were identified as 

10 tryptic peptides originating from PTB-associated splicing factor (PSF) (P23246). The 
presence of PSF in the SMN complex turned out to be negative by both co- 
immunoprecipitation and direct binding to several components of the SMN complex. 
Peptide ions designated using asterisks (*) were identified as peptides from 
immunoglobulins used for immunoaffinity purification. Tandem mass spectra acquired 

15 from the peptide ions having m/z 552.9 and 622.5 did not identify any protein in the 
protein sequence database(s) searched. However, when the search was performed 
against a comprehensive database of expressed sequence tags (i.e., dbEST) the peptide 
sequence VLISTDLTSR from EST clone W65908 was identified as matching the 
tandem mass spectrum. After fiiU length sequence had been obtained as described 

20 elsewhere herein, the tandem mass spectrum acquired from the peptide ion at m/z 
622.5 was matched to the peptide LNSSDPSLIGLK (SEQ ID NO: 9) present in the 
sequence of GeminS. 

Figure 25B is a diagram depicting the tandem mass spectrum acquired 
from doubly charged peptide precursor ion having m/z552.9. Continuous series of the 

25 fragment ions containing the C-terminus of the peptide (Yii-ions) (Roepstorff and 
Fohlman, 1984, Biomed. Mass Spectrom. 1 1 :601) was produced upon coUisional 
fragmentation of tryptic peptides. A short stretch of the peptide sequence was deduced 
unambiguously by considering precise mass differences between adjacent Yii-ions 
(designated in bold capital letters) observed in a part of the spectrum above m/z of the 

30 parent ion. Leucine and isoleucine residues have the same nominal mass and are 

-29- 

10873 V.2 



usually not distinguishable by mass spectrometry and are therefore indicated as "L/I." 
The determined portion of a peptide sequence was combined with the masses of 
correspondent Yii-ions and with the mass of intact peptide into a peptide sequence tag 
as described by Mann and Wihn (1994, Analytical Chemistry 66:4390-4399), which 
5 was subsequently used to searching protein and EST databases using the program 
PeptideSearch. Once the database search produced a hit, the correspondent peptide 
sequence was retrieved from a database and masses of the ions from the N-terminal 
fragment series (A- and B-ions) were used to verify the match. This protocol enabled 
highly confident protein identification of a single peptide containing ten amino acid 

1 0 residues was matched to the sequence of a single EST clone. 

Figure 26 is a diagram depicting a schematic representation of the 
modular structure of Gemin3 and the various domains of Gemin3. This diagram 
illustrates that Gemin3 encodes a DEAD-box containing RNA helicase. Further, the 
diagram discloses that Gemin3 contains seven helicase motifs (hehcase motifs are 

15 reviewed in De la Cruz et al., 1999, TIBS 24:192-198) designated I, la, II, III, IV, V 
and VI, which are represented by black boxes with conserved amino acids represented 
in white letters within the boxes. Amino acid residues designated by upper case letters 
indicate highly conserved residues, lower cases indicate less conserved residues. The 
SMN interacting domain (Le., amino acids 456 to 547) is boxed in grey. 

20 Figure 27 is a diagram depicting the amino acid sequence alignment of 

human Gemin3 (SEQ ID N0:8) and the human DEAD box ATP-dependent RNA 
helicase eIF4A-IL The N-terminal half of Gemin3 contains a DEAD box RNA 
helicase domain while the C-terminal half does not exhibit homology to any protein in 
the database. Amino acids shaded in light grey indicate similar amino acids, and dark 

25 grey indicates identical aimno acids. The position of the seven helicase motifs as well 
as the SMN interacting domain are indicated. 

Figure 28A is an image depicting a gel demonstrating that monoclonal 
antibodies 1 1G9 and 12H12 are specific for Gemin3. Myc-tagged Gemin3, hnRNP Al 
and SMN proteins were produced using a rabbit reticulocyte lysate in the presence of 

30 [^^SJmethionine. The labeled proteins were immunoprecipitated using monoclonal 

-30- 

10873 V.2 



antibody 1 1G9 and 12H12 and the immimoprecipitated material was analyzed using 
SDS-PAGE followed by autoradiography. Ten percent (10%) of the total in vitro 
translated proteins is depicted on the left panel 

Figure 28B is an image depicting immunoblotting using monoclonal 
5 antibody 1 1G9 on purified 6His-Gemin2 and 6His-Gemin3. The position of the 
molecular weight markers is indicated on the left side of the figure (in kDa). 

Figure 28C is an image of a Western blot depicting immunoblotting 
using monoclonal antibodies 12H12 and 11G9 on total HeLa cell protein extract. In 
addition to the strong pl05 signal, 12H12, but not 1 1G9, produced a weal signal by 
10 binding to a protein of about 55 kDa on Westem blots. The position of the molecular 
weight markers is indicated on the left side of the figure (in kDa). 

Figure 29A is an image depicting the localization of Gemin3 in HeLa 
cells. This image depicts a laser confocal image of indirect immimofluorescence on 
HeLa cells using monoclonal antibody 12H12 against the Gemin3 protein. The data 
15 disclosed herein demonstrate general cytoplasmic staining, as well as nucleoplasmic 
and discrete nuclear structures. 

Figure 29B is an image depicting a DIG image of the same cell depicted 
in Figure 29A. The arrows indicate gems. 

Figure 29C is an image depicting the fact that GeminS co-localizes with 
20 p80 coilin. This image depicts superimposed laser confocal images of double-label 
immunofluorescence microscopy experiments using antibodies against coiled bodies 
marker, pSO coilin (green) and anti-GeminS 1 1G9 (red). Co-localization of green and 
red signals results in a yellow signal. Dashed lines demarcate the nucleus. 

Figure 29D is an image depicting the fact that SMN co-localizes with 
25 p80 coilin. This image depicts superimposed laser confocal images of double-label 
immunofluorescence microscopy experiments using antibodies against coiled bodies 
marker, p80 coilin (green), and anti-SMN antibody 2B1 (red). Co-localization of green 
and red signals results in a yellow signal. Dashed lines demarcate the nucleus. 

Figure 29E is an image depicting the fact that SMN co-localizes with 
30 GeminS. This image depicts superimposed laser confocal images of double-label 

-31- 

10873 V.2 



immunofluorescence microscopy experiments using monoclonal anti-Gemin3 1 1G9 
(red) and a rabbit affinity-purified antibody against Exon 7 of human SMN (green). 
Co-localization of green and red signals results in a yellow signal. Dashed lines 
demarcate the nucleus. 

5 Figure 30A is an image depicting a gel demonstrating that GeminS is in 

a complex with SMN, Gemin2 (also referred to as SIPl), and the spliceosomal Sm 
proteins. This image depicts immunoprecipitations of [^^S)methionine labeled HeLa 
cell using monoclonal antibodies specific to SMN (2B1), GeminS (1 1G9), and the 
snRNP core Sm proteins (Y12). The immunoprecipitated proteins were analyzed using 

10 SDS-PAGE followed by autoradiography (24 hours exposure). The 

immunoprecipitations were performed in the presence of Empigen BB or Triton X-100 
as indicated elsewhere herein. The identity of several pertinent proteins 
immunoprecipitated is indicated on the right side of the figure. 

Figure 30B is an image depicting a longer exposure (36 hours) of the 

15 bottom portion of the gel depicting immunoprecipitation in the presence of TritonX-100 
(/.e., Figure 30A). The longer exposure was performed to visualize the Sm proteins 
more clearly. 

Figure 30C is an image of a gel depicting the fact that Gemin3, SMN, 
Gemin2 and the Sm proteins can be co-immunoprecipitated in vivo. This image 

20 depicts that monoclonal antibodies against Gemin3 co-immunoprecipitate SMN and 
Gemin2. Immunoprecipitation using total HeLa protein extract was performed using 
monoclonal antibody 1 1G9 and the immunoprecipitated proteins were analyzed by 
Western blot using 2B1 (anti-SMN) or using 2S7 (anti-Gemin2) antibodies. 

Figure 30D is an image of a gel depicting the fact that monoclonal 

25 antibodies against SMN, Gemin2, and the Sm proteins co-immunoprecipitate Gemin3. 
Immunoprecipitation using total HeLa cell protein extract was performed using 
monoclonal antibodies against SMN (lane 2B1 IP), Gemin2 (lane 2S7 BP), or the Sm 
proteins (lane Y12 EP). The immimoprecipitated proteins (IP) were analyzed by 
Western blot using the anti-Gemin3 monoclonal antibody 12H12. The position of the 

30 molecular weight markers is indicated on the left side of the figure (in kDa). The 

-32- 

10873 V.2 



positions of the light chain (I.e.) and heavy chain (h.c.) of the antibodies used for 
immunoprecipitation are indicated. 

Figure 30E is an image depicting the fact that GeminS, SMN, and 
Gemin2 are found in a complex of about 800 kDa, or more, in the cytoplasm. HeLa 
5 cytoplasmic SI 00 extract was fractionated using a Superose 6 HR 10/30 column. The 
fractions were analyzed by SDS-PAGE, and the GeminS, SMN, and Gemin2 proteins 
were detected by Western blotting. The fraction number and the molecular weight 
standards were as indicated. 

Figure 31A is an image of a gel depicting the fact that GeminS interacts 
10 directly with SMN in vitro. In vitro translated [*^^5]methionine labeled mycSMN and 
myc-Gemin2 proteins were incubated with purified GST-GeminS as described 
elsewhere herein. Bound proteins were analyzed using SDS-PAGE followed by 
fluorography. The in vitro translation panel depicts 2% of the total input. 

Figure 3 IB is an image of a gel depicting that GeminS interacts with a 
15 subset of the Sm proteins in vitro. In vitro translated [^^S]methionine labeled mycSm 
proteins B, Dl. D2, DS, E, F, and G were incubated with purified GST-GeminS or 
GST-SMN as described elsewhere herein. Bound proteins were analyzed using SDS- 
PAGE followed by fluorography. The in vitro translation panel contains 2% of the 
input. 

20 Figure 31C is an image of a gel depicting the fact that GeminS interacts 

directly with SMN and SmB in vitro. Wild-type recombinant 6His-SMN or 6His-SmB 
proteins were incubated with purified GST-GeminS, or with GST alone. The input 
lane depicts 10% of 6His-SMN and 6His-SmB. Bound proteins were analyzed by 
SDS-PAGE and Westem blotting. 

25 Figure 31D is an image of a gel depicting the fact that mutations found 

in SMA severely affect SMN interaction with GeminS. In vitro translated 
[^^Sjmethionine labeled wild-type myc-SMN and the indicated mutant proteins were 
incubated with purified GST-GeminS as described elsewhere herein. The proteins 
bound with GST-GeminS were analyzed using SDS-PAGE followed by fluorography. 

SO The in vitro translation panel shows 2% of the input. 

-33- 

10873 V.2 



Figure 31E is an image of a gel depicting the fact that SMN 
oligomerization does not affect the GeminS interaction. In vitro translated 
[^^S]methionine labeled myc-Gemin3 or myc-SmB proteins were incubated with 
purified GST or GST-SMN which had been pre-incubated or not with 6His-SMN wild- 
5 type protein as described elsewhere herein. Bound myc-Gemin3 and myc-SmB 

proteins were analyzed using SDS-PAGE followed by fluorography (top portion of the 
figure ). The in vitro translation panel contains 5% of the input. The position of the 
molecular weight markers is indicated on the left (in kDa). 

Figure 3 IF is an image of a Western blot depicting the amount of 6His- 

10 SMN which bound to GST or GST-SMN. The input lane depicts 10% of 6His-SMN. 
About 5% of the 6His-SMN input is bound to GST-SMN. 

Figure 32A is a schematic representation of the myc-Gemin3 wild type 
and deletion mutants used in the binding assays. The black boxes represent the seven 
helicase domains and the grey box represents the auxiliary domain C-terminal domain. 

15 Figure 32B is an image of a gel depicting that Gemin3 interaction of 

Gemin3 with SMN is mediated by the non-conserved C-terminal domain of Gemin3. 
In vitro translated [^^Sjmethionine labeled wild-type and mutant myc-Gemin3 proteins 
were incubated with purified GST-SMN or GST alone. Bound myc-Gemin3 proteins 
were analyzed using SDS-PAGE followed by fluorography. The in vitro translation 

20 panel shows 5% of the input. 

Figure 33A is an image depicting a Western blot demonstrating that the 
hemagglutinin-chicken SMN (HA-cSMN) fiision protein was expressed in transfected 
cells DT40 in the absence of tetracycline fi-om the cell but was not expressed in the 
presence of 2 micrograms per milliliter of tetracycUne. 

25 Figure 33B is an image depicting a Westem blot demonstrating the fact 

that expression of HA-SMN upon removal of tetracycline fi-om the culture medium 
caused expression of a HA-cSMN fixsion protein, which is larger than wild type cSMN 
because of the additional amino acid residues comprising the HA tag polypeptide, and 
lack of expression of wild type chicke SMN (cSMN). More specifically, in the 

30 presence of 2 micrograms per miUiliter tetracycline, only wild type cSMN is detected 

-34- 

10873 V.2 



by Western blot analysis using anti-chicken SMN antibody. In the absence of 
tetracycline (i.e., in lanes designated "0") only the higher molecular weight HA-cSMN 
fusion protein was detected and cSMN was not detected. 

Figure 34A is an image depicting a Westem blot demonstrating that 
5 inhibition of cSMN expression and induction of HA-cSMN expression are dose- 
dependent upon the concentration of tetracycline present in the cell culture medium. 
That is, decreasing concentration of tetracycline caused increased expression of HA- 
cSMN and concommitant decreased expression of cSMN. 

Figure 34B is an image depicting a Westem blot demonstrating the fact 

1 0 that expression of HA-cSMN increases upon removal of tetracycline and that 

repression of cSMN expression upon removal of tetracycline is permanent. That is, 
removal of tetryclinc results in continued expression of HA-cSMN and permanent lack 
of expression of cSMN (until cell death occurs since HA-cSMN does not provide 
necessary SMN function to the cell). 

15 Figure 35 A is the amino acid sequence of chicken SMN (SEQ ID 

NO: 10). 

Figure 35B is the nucleic acid sequence of chicken SMN (SEQ ID 

N0:9). 

Figure 36A-B is the polypeptide sequence and 36C-F is the nucleic 
20 acid sequence of human GeminS (SEQ ID N0:7). 

Figure 37 is the nucleic acid sequence of human SEPl (SEQ ID N0:1; 
Gen Bank Accession No. AF027150). 

Figure 38 is the nucleic acid sequence of frog SIPl (SEQ ID N0:3; Gen 
Bank Accession No. AF0271501). 
25 Figure llOAl 39 is an image of an SDS-PAGE eel depicting the in vitro 

mapping of the SMN interaction domains. In vitro translated f^^Slmethionine labeled 
mvc-tagged-SMN wild-tvpe and mutant proteins were incubated as indicated with 
purified GST or with GST-fusion proteins (SIPl, SMN or SmB^ bound to glutathione- 
Sepharose beads as described elsewhere herein. Bound proteins were eluted from the 
30 beads bv boiling the beads in SDS-PAGE sample buffer and the eluted proteins were 

-35- 

10873 V.2 



analyzed by SDS-PAGE and fluorographv. The in vitro translation lanes contained 
10% of the input used for the binding assay. 

Figure flOBl 40 is a diagram depicting a schematic representation of 
the SMN mutants and summarizing their in vitro binding properties to the GST-fusion 
5 proteins. The first and the last amino acids of the SMN protein fragments are 

indicated. The abbreviations are as follows: strong binding (+y weak binding no 
detectable binding 

Figure [11A1 41 is an image of a gel depicting the fact that the 
dimerization and SmB binding domains of SMN are not identical. In vitro translated 

10 r^^Slmethionine labeled SMN wild-tvpe and SMN mutant proteins were fused to the 
maltose binding protein (MBP) and were incubated with purified GST (only the MBP- 
SMN wild-type fusion), GST-SMN or GST-SmB bound to glutathione-Seoharose 
beads as described in the Materials and Methods. Bound proteins were eluted from the 
beads by boiling in SDS-PAGE sample buffer and the eluted proteins were analyzed by 

15 SDS-PAGE and fluorography. The in vitro translation lanes contained 10% of the 
input used for the binding assay. 

Figure fllBI 42 is a schematic representation of the MBP-SMN fusion 
proteins and a summary of their in vitro binding properties to GST-SMN and GST- 
SmB. The first and the last amino acids of SMN proteins are indicated and the 

20 abbreviations are as follows: strong binding M and no detectable binding (-V 

Figure fl21 43 is an image of an SDS-PAGE gel depicting that SMN 
dimerization increases the binding affinity of SMN for SmB. GST or GST-SMN 
bound to glutathione-Sepharose beads were pre-incubated in binding buffer with or 
without a four-fold molar excess of purified recombinant His6-tagged-SMN for two 

25 hours at 4EC in a total yolume of 1 ml. Unbound His6-tagged-SMN was eliminated by 
washing the beads five times with binding buffer. In vitro translated T^^SImethionine 
labeled SMN. SmB and SIPl were then added to the beads. The beads were washed 
and the bound proteins were eluted by boiling the beads in SDS-PAGE sample buffer. 
The eluted proteins were analyzed by SDS-PAGE and followed by fluorography. The 

30 in vitro translation lanes contained 10% of the input used for the binding assay. 

-36- 

10873 V.2 



Figure fl3A1 44 is an image of a gel depicting the in vitro mapping of the 
SIPl interaction domains. In vitro translated [^^Slmethionine labeled mvc-tagged-SIPl 
wild type and mutant proteins were incubated as indicated with purified GST or GST- 
fusion proteins (SMN, SIPl or SmB) bound to glutathione-Sepharose beads. Bound 
5 proteins were eluted fi'om the beads bv boiling the beads in SDS-PAGE sample buffer 
and the eluted proteins were analyzed bv SDS*PAGE and fluorographv. The in vitro 
translation lanes contained 10% of the input used for the binding assay. 

Figure fl3B1 45 is a schematic representation of the SIPl mutants and a 
summary of their in vitro binding properties to GST-SMN. The first and the last amino 

10 acids of SIPl proteins are indicated. The abbreviations are as follows: strong binding 
(+). no detectable binding (-V 

Figure fl4A] 46A is an image of an SDS-PAGE gel depicting the in vivo 
analysis of the SMN complex bv co-immunoprecipitation experiments. 293T cells 
were transiently transfected with the indicated myc4agged DNA constructs. 48 hours 

15 post-transfection, the cells were collected and the total extracts were prepared as 

described elsewhere herein. Extracts from cells transfected with myc-SIPl wild-type 
(wt) or with deletion mutant (AC89) were immunoprecipitated with the anti-SMN 
antibody (2B1). Total and immunoprecipitated fractions were analyzed bv Western 
blot with 2B1 and anti-mvc (9E10) antibodies. 

20 Figure f 14B1 46B is an image of a gel depicting the proteins in extracts 

from cells transfected with mvc-SMN wildtvpe (wt) or with deletion mutant (AN92^ 
which were immunoprecipitated with anti-SIPl (2E17) antibody. Total and 
immunoprecipitated fractions were analyzed by Western blot with anti-mvc 9E10. 

Figure fl4Cl 47 is an image of a gel depicting the proteins in extracts fi'om 

25 cells transfected with mvc-SMN wildtvpe (wt) or deletion mutant (AC26) which were 
inununoprecipitated with both anti-Sm (Y12) and anti-SIPl (2E17) antibodies. Total 
and Y12 immunoprecipitations were analyzed bv Westem blot with anti-mvc 9E10 and 
anti-sm Y12. Immunoprecipitations with anti-SMN(2E17) were analyzed bv Westem 
blot with 9E10 and 2E17. The heavy and light chains of the immunoglobulins are 

30 indicated (h.c. and I.e., respectively). 



10873 V.2 



-37- 



Figure flSAl 48 depicts a schematic representation of SMN and SIPl 
interaction domains and depicts a model of the SMN/SIPl complex. The diagram 
depicts a summary of the regions of SMN and SIPl required for their interactions 
within the complex (SMN exons are indicated). The SMN domain required for SmB 
5 binding contains the highly conserved YG box described by Talbot et aL, (1997. Hum. 
Mol. Genet. 6:497-500). 

Figure flSBl 49 is a diagram illustrating that> according to the in vivo and 
in vitro analysis disclosed herein, the SMN/SIPl complex is at least a tetramer formed 
by two directly interacting SMN molecules, each of which is also bound to a SIPl 
10 molecule. SIPl interacts with SMN directly but not with itself Dimeric SMN has a 
much higher binding affinity than monomeric SMN for the substrate, namely the Sm 
proteins. 

Figure fl61 50 is an image (comprising 8 panels) of photomicrographs 
depicting the subcellular localization of SMN deletion mutants using confocal 

15 inmiunofluorescence microscopy. HeLa cells were transiently transfected with SMN 
constructs expressing myc-tagged SMN wild-type or with the indicated myc-tagged 
SMN deletion mutants (see Figure [10B1 40). Indirect immunofluorescence was 
performed using monoclonal antibody 9E10 against the myc-tag protein. Gems 
staining was detected by anti-myc antibodies and the staining completely co-localizes 

20 with immunostaining of endogenous SIPl . The intensity of the cytoplasmic staining is 
underestimated by approximately two-fold in the sections shown. 

Figure TlTl 51 is an image (comprising four panels) of photomicrographs 
depicting the subcellular localization of SIPl deletion mutants using confocal 
immunofluorescence microscopy. HeLa cells were transiently transfected with SIPl 

25 constructs expressing myc-tagged SIPl wild-type or with the indicated myc-tagged 

SIPl deletion mutants (see Figure f 13B1 45). Indirect immunofluorescence was 

performed using monoclonal antibody 9E10 against the myc-tag protein. Gems 

staining was detected by anti-myc antibodies which completely co-localize with 

antibodies to endogenous SMN. The intensity of the cytoplasmic staining is 

30 xmderestimated by approximately two-fold of photomicrographs in the sections shown. 

-38- 

10873 V.2 



Figure f 181 52 is an image (comprising four panels) depicting the targeting 
of pyruvate kinase (PK) to gems. The image depicts the results of confocal 
immunofluorescence microscopy demonstrating HeLa cells which were transiently 
transfected with myc-PK constructs expressing myc4agged pyruvate kinase fused to 
5 either SMN wild»tvpe or the indicated SMN deletion mutants (see Figure FlOB] 40). 

Indirect immunofluorescence was performed using monoclonal antibody 9E10 against 
the mvc-tag protein. Gems are indicated by arrows. Gems staining detected bv anti- 
mvc antibodies completely co-localize with endogenous SIPl. 

1 0 DETAILED DESCRIPTION OF THE INVENTION 

The invention is based on the discovery of a novel gene (SEO ID NO:l) 
and a protein encoded thereby (SEQ ID N0:2), designated the Survival of Motor 
Neuron-Interacting Protein 1 (SIPl), so called because the protein is tightly associated 
with the SMA disease gene product, SMN. Further, the protein has been provisionally 

15 designated Gemin2 since it is the second component of gems to be identified to date, 
SMN being the first. In addition, the nucleic and amino acid sequences of firog and 
yeast homologs of human SIPl are also disclosed herein. 

The invention further relates to the discovery that SMN and SIPl form a 
large, approximately 300 kDa, more preferably, a 800 kDa, complex in vivo which 

20 includes other SMN-associated proteins such as the Sm proteins. The data provided 

herein estabUsh that these proteins (e.g., SMN and SIPl) play an important role in 
mRNA biogenesis and that SMA is the first human disease identified involving a 
defect in mRNA metabolism. Indeed, the data disclosed herein demonstrate that the 
SMN proteins of SMA patients bind with themselves, with SIPl, or both, with less 

25 affinity that does wild type SMN. These results further indicate a causal connection 

between decreased association between SMN and SIPl and SMA disease processes. 
Additionally, the invention relates to the discovery of a novel gene (SEQ ID NO:7) and 
protein encoded thereby (SEQ ID NO: 8) which is the third protein component of gems 
identified to date and has therefore been designated GeminS. Further, the invention 



10873 V.2 



-39- 



relates to the discovery that GeminS also binds with SMN such that Gemin3 is 
associated with the afore-mentioned complex involved in mRNA biogenesis. 

The invention also relates to a novel cell line which lacks endogenous 
SMN but which comprises an exogenous nucleic acid encoding chicken SMN (a novel 
5 homolog of human SMN [SEQ ID N0:9] which is disclosed elsewhere herein) under 
the control of an inducible promoter/regulatory sequence. This cell line is an important 
tool for the study of SMA disease mechanism(s) and for the identification of novel 
therapeutics useful for treatment of this deadly hereditary disease for which there is no 
known cure. 

10 Nucleic acids and proteins 

The present invention, therefore, provides a novel isolated nucleic acid 
(SEQ ID N0:1) encoding the human SIPl protein (huSIPl [SEQ ID N0:2]). In 
addition, the invention provides an isolated nucleic acid (SEQ ID NO:3) encoding the 
Xenopus laevis SIPl protein (XeSIPl [SEQ ID N0:4. 

1 5 The invention includes isolated nucleic acids encoding a eukaryotic 

SIPl which is, preferably, at least about 20% homologous to atleast one of huSIPl 
(SEQ ID N0:1), dindXeSIPl (SEQ ID NO:3). More preferably, the isolated nucleic 
acid encoding a eukaryotic SIPl is at least about 25%, preferably, at least about 35%, 
more preferably, at least about 45%, even more preferably, at least about 55%, more 

20 preferably, at least about 65%, even more preferably, at least about 75%, yet more 
preferably, at least about 85% homologous, more preferably, at least about 95% and 
even more preferably, at least about 99% homologous to at least one of huSIPl (SEQ 
ID NO: 1) and XeSIPI (SEQ ID N0:3). More preferably, the isolated nucleic acid 
encoding a eukaryotic SIP is human huSIPl and frog XeSIPI, Most preferably, the 

25 isolated nucleic acid encoding a eukaryotic SIPl is SEQ ID N0:1 or SEQ ID N0:3. 

Thus, the invention also includes an isolated nucleic acid encoding a 
eukaryotic SIPl where the nucleic acid encodes a protein which protein is preferably, 
at least about 20% homologous to the amino acid sequence of at least one of human 
SIPl (SEQ E) N0:2) and frog SIPl (SEQ ID N0:4). More preferably, the isolated 

30 nucleic acid encodes a eukaryotic SIPl which is at least about 25%, more preferably. 



10873 V.2 



-40- 



at least about 35%, even more preferably, at least about 45%, more preferably, at least 
about 55%, yet more preferably, at least about 65%, even more preferably, at least 
about 75%, yet more preferably, at least about 85% homologous, and more preferably, 
at least about 95%, and even more preferably, at least about 99% homologous to at 
5 least one of human SIPl (SEQ ID N0:2) and frog SIPl (SEQ ID N0:4). More 

preferably, the isolated nucleic acid encodes a eukaryotic SLPl that is human SIPl or 
frog SIPl. Most preferably, the isolated nucleic acid encodes a eukaryotic SIPl having 
the amino acid sequence SEQ ID N0:2 or SEQ ID N0:4. 

Thus, the invention should be construed to include nucleic acids which 

10 encode human SIPl (SEQ ID N0:2) and frog SIPl (SEQ ID N0:4). 

The invention also includes a nucleic acid encoding a mammalian 
GeminS protein. Preferably, the nucleic acid encoding a mammalian Gemin3 is at least 
about 20% homologous to human GeminS (SEQ ED N0:7). More preferably, the 
isolated nucleic acid encoding a mammalian GeminS is at least about 25%, preferably, 

15 at least about 35%, more preferably, at least about 45%, even more preferably, at least 
about 55%, more preferably, at least about 65%, even more preferably, at least about 
75%, yet more preferably, at least about 85% homologous, more preferably, at least 
about 95% and even more preferably, at least about 99% homologous to (SEQ ID 
N0:7). More preferably, the isolated nucleic acid encoding a mammalian Gemin3 is 

20 human GeminS, Most preferably, the isolated nucleic acid encoding a mammalian 
Gemin3 is SEQ ID N0:7. 

In another aspect, the present invention includes an isolated nucleic acid 
encoding a mammalian GeminS^ or a fragment thereof, wherein the protein encoded by 
the nucleic acid shares at least about 20% homology with the amino acid sequence of 

25 SEQ ID N0:8. Preferably, the nucleic acid encodes a protein that is about 35% 

homologous, more preferably 65% homologous, even more preferably 95% 
homologous, and most preferably about 99% homologous to the human Gemin3 
disclosed herein (SEQ ID NO: 8). Even more preferably, the Gemin3 protein encoded 
by the nucleic acid is SEQ ID N0:8. 



10873 V.2 



.41. 



The present invention includes a nucleic acid encoding chicken SMN, or 
a fragment thereof, wherein the nucleic acid shares at least about 20% homology with 
chicken SMN (SEQ ED N0:9). Preferably, the nucleic acid is about 35% homologous, 
more preferably about 65% homologous, even more preferably 95% homologous, and 
5 most preferably about 99% homologous to the chicken SMN disclosed herein (SEQ ID 
N0:9). Even more preferably, the nucleic acid is SEQ ID N0:9. 

In another aspect, the present invention includes an isolated nucleic acid 
encoding chicken SMN, or a fragment thereof, wherein the protein encoded by the 
nucleic acid shares at least about 80% homology with the amino acid sequence of SEQ 

10 ID NO: 10. Preferably, the nucleic acid encodes a protein that is about 85% 
homologous, more preferably 85% homologous, even more preferably 95% 
homologous, and most preferably about 99% homologous to the chicken SMN 
disclosed herein (SEQ ID NO: 10). Even more preferably, the chicken SMN protein 
encoded by the nucleic acid is SEQ ID NO: 10. 

15 Thus, the invention should be construed to include nucleic acids which 

encode chicken SMN (SEQ ID NO: 10). 

The isolated nucleic acid of the invention should be construed to include 
an RNA or a DNA sequence encoding an SIPl protein of the invention, and any 
modified forms thereof, including chemical modifications of the DNA or RNA which 

20 render the nucleotide sequence more stable when it is cell free or when it is associated 
with a cell. Similarly, the invention includes such modified forms of GeminS and 
chicken SMN. Chemical modifications of nucleotides may also be used to enhance the 
efficiency with which a nucleotide sequence is taken up by a cell or the efficiency with 
which it is expressed in a cell. Any and all combinations of modifications of the 

25 nucleotide sequences are contemplated in the present invention. 

The present invention should not be construed as being limited solely to 
the nucleic and amino acid sequences disclosed herein. Once armed with the present 
invention, it is readily apparent to one skilled in the art that other nucleic acids 
encoding SIPl, SMN, and GeminS proteins may be obtained by following the 

30 procedures described herein in the experimental details section for the isolation of the 

10873 V.2 -42- 



human, frog and yeast nucleic acids encoding SIPl proteins, as well as those 
procedures used to identify and isolate human GeminS and chicken SMN, and the 
respective polypeptides encoded thereby {ue., Gemin3 and chicken SMN), as disclosed 
elsewhere herein. 

5 Further, the invention should be construed to include naturally occurring 

variants or recombinantly derived mutants of wild type SIPl DNA sequences, which 
variants or mutants render the protein encoded thereby either more, less, or just as 
biologically active as the full-length wild type SIPl of the invention. Any number of 
procedures may be used for the generation of mutant, derivative or variant forms of 
10 SDPl, GeminS, and SMN, using recombinant DNA methodology well known in the art 
such as, for example, that described in Sambrook et al. (1989, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, New York), Ausubel et al. 
(1997, Current Protocols in Molecular Biology, Green & Wiley, New York), and 
elsewhere herein. 

15 As will be apparent from the experiments described herein, SIPl 

comprises a binding domain which mediates SIPl binding to SMN. The invention 
includes a mutant of SIPl, wherein a mutation is introduced into the sequence of SIPl 
cDNA such that the protein encoded thereby does not bind SMN or binds to a lesser 
extent. Such mutant is useful in the methods of the invention and for the study of the 

20 role of SMN-SIPl interaction in mRNA biogenesis. Preferably, a mutant SIPl gene 
which encodes an SIPl protein comprising a deletion whereby amino acids from the 
carboxyl terminal of the SIPl protein have been removed is useful in studying the 
association of SIPl with SMN. Examples of such mutants (SIPl AC89 and 
SIPl AC 162) have been disclosed herein. However, the invention is not limited solely 

25 to these mutants; rather, the invention encompasses other mutants, comprising deletion 
and point mutations, which demonstrate altered binding to SMN or other SMN-SIPl 
complex associated proteins {e.g., Gemin3 and various Sm proteins) and ribonucleic 
acids. These mutants allow the functional mapping of various portions of the protein to 
determine which portion(s) of the protein mediate which function(s) of the full-length 

30 protein. 



10873 w2 



-43- 



Likewise, the present invention includes an isolated nucleic acid 
encoding SMN and mutants, homologs and variants thereof which exhibit altered 
binding with another SMN molecule and/or decreased binding with the SMN- 
associated protein SIPl. These proteins are important tools to determine which 
5 portion(s) of the proteins mediate the biological effect(s) associated with the full-length 
protein. Further, they are useful in elucidating the mechanisms of SMA and in the 
identification of therapeutics for treatment of this disease. 

The invention also includes selected mutant, homologs, and variants of 
GeminS, Several of the proteins exhibited unchanged binding with SMN despite 

10 deletions of selected portions of the amino acid sequence (e.g., AN368C272Gemin3) 
while other proteins demonstrated lack of binding with SMN (e.g., AC328Gemin3 and 
AN548Gemin3). Such proteins are important tools in elucidating the effect of 
naturally-occurring and/or artificially produced amino acid sequences in cell processes 
and, more specifically, for assessing the function of various proteins, and fi-agments 

15 ^ thereof, in SMA. 

The invention also includes a nucleic acid encoding SMN or SIPl, and 
mutants, derivatives, variants, and fragments thereof, that may retain biological 
activity. Such variants, Le., analogs of SMN or SIPl, include proteins or polypeptides 
which have been or may be modified using recombinant DNA technology such that the 

20 protein or polypeptide possesses additional properties which enhance its suitability for 
use in the methods described herein, for example, but not limited to, variants 
conferring enhanced stability on the SMN-SIPl complex, enhanced specific binding of 
SIPl to SMN, and the like. 

Procedures for the introduction of amino acid changes in a protein or 

25 polypeptide by altering the DNA sequence encoding the polypeptide are well known in 
the art and are also described in Sambrook et al. (1989, supra); Ausubel et al. (1997, 
supra). 

The invention includes a nucleic acid encoding a mammalian SMN, 
SIPl, or Gemin3 wherein a nucleic acid encoding a tag polypeptide is covalently 
30 * linked thereto. That is, the invention encompasses a chimeric nucleic acid wherein a 



10873 V.2 



-44- 



nucleic acid sequence encoding a tag polypeptide is covalently linked to a nucleic acid 
encoding at least one of human SIPl, frog SIPl, yeast Brrl, human Gemin3, and 
chicken SMN. Such chimeric (/>., fusion) tag polypeptides are well known in the art 
and include, for instance, myc, myc-pyruvate kinase (myc-PK), His6, maltose biding 
5 protein (MBP), and glutathione-S-transferase (GST). However, the invention should in 
no way be constmed to be limited to the nucleic acids encoding the above-listed tag 
polypeptides. Rather, any nucleic acid sequence encoding a polypeptide which may 
function in a manner substantially similar to these tag polypeptides should be construed 
to be included in the present invention. 

10 A nucleic acid encoding a protein of interest (e,g,, SMN, SIPl, GeminS, 

and any mutant, derivative, variant, or fragment thereof) comprising a nucleic acid 
encoding a tag polypeptide and a fusion protein produced therefrom can be used to 
localize SMN, SIPl, and Gemin3 within a cell and to study expression, localization, 
and role(s) of the tagged protein in a cell before, during, and/or after exposing the cell 

15 to a test compound potentially useftil for treatment of SMA. Further, addition of a tag 
to a protein of interest (eg., SMN, SIPl, Gemin3) polypeptide facilitates isolation and 
purification of the "tagged" protein such that the protein of interest can be easily 
produced and purified. 

Additionally, variants may be chemically linked to another protein. For 

20 example, SMN was conjugated with bovine serum albumin (BSA) as described in 
Fischer et al. (1995, Cell 82:475-483). However, the invention is not limited to 
chemically linking SMN with BSA, but also includes cross-linking SMN, SIPl, and 
Gemin3 proteins by chemical means to any suitable composition. Thus, it should be 
appreciated that the use of BSA is described herein by way of example only. 

25 In certain situations, it may be desirable to inhibit expression of SMN, SIPl, or 

Gemin3 in a cell that would otherwise express the protein. Therefore, the invention 
includes compositions useful for inhibition of expression of SMN, SIPl, and Gemin3. 
One such method of the invention features an isolated nucleic acid complementary to a 
portion or all of a nucleic acid encoding an SMN, SIPl, or Gemin3 which is in an 

30 antisense orientation with respect to transcription. Preferably, the antisense nucleic 

10873 V.2 yir 



acid useful for inhibition of SIP 1 expression shares at least 20% homology with at least 
one of SEQ ID N0:1, SEQ ID N0:3, SEQ ID N0:5. 

Similarly, an antisense nucleic acid useful for inhibition of GeminS 
expression shares at least about homology 20% with SEQ ID NO: 7. 

Likewise, an antisense nucleic acid useful for inhibition of GeminS 
expression shares at least about homology 20% with SEQ ID N0:9. 

The above-referred to antisense nucleic acids serve to inhibit the 
expression, function, or both, of human SIPI, frog SIPl, human Gemin3, and chicken 
SMN. 

The invention also includes an isolated polypeptide comprising a 
eukaryotic SIPl. Preferably, the isolated polypeptide comprising a eukaryotic SIPl is 
at least about 20% homologous to at least one of human SIPl (SEQ ID NO:2) and frog 
SIPl (SEQ ID N0:4). More preferably, the isolated polypeptide comprising a 
eukaryotic SIPl is at least about 35%, more preferably, at least about 65%, even more 
preferably, at least about 95%, and more preferably, at least about 99% homologous to 
at least one of human SIPl and frog SIPL More preferably, the isolated polypeptide 
comprising a eukaryotic SIPl is at least one human SIPl and frog SIPL Most 
preferably, the isolated polypeptide comprising a mammalian eukaryotic SIPl is at 
least one of SEQ ID NO: 2 and SEQ ID N0:4. 

The invention also includes an isolated polypeptide comprising a 
mammalian GeminS. Preferably, the isolated polypeptide comprising a mammalian 
GeminS is at least about 20% homologous to human Gemin3 (SEQ ID NO: 8). More 
preferably, the isolated polypeptide comprising a mammalian GeminS is at least about 
35%, more preferably, at least about 65%, even more preferably, at least about 95%, 
and more preferably, at least about 99% homologous to human Gemin3. More 
preferably, the isolated polypeptide comprising a mammalian Gemin3 is human 
Gemin3. Most preferably, the isolated polypeptide comprising a mammalian Gemin3 
is SEQIDN0:8. 

The invention also includes an isolated polypeptide comprising a 
chicken SMN. Preferably, the isolated polypeptide comprising a chicken SMN is at 

10873 V.2 _A^_ 



least about 20% homologous to SEQ ID N0:8. More preferably, the isolated 
polypeptide comprising a chicken SMN is at least about 35%, more preferably, at least 
about 65%, even more preferably, at least about 95%, and more preferably, at least 
about 99% homologous to chicken SMN. More preferably, the isolated polypeptide 
comprising is chicken SMN. Most preferably, the isolated polypeptide comprising a 
chicken SMN is SEQ ID NO: 10. 

The present invention also provides for analogs of proteins or peptides 
which comprise a eukaryotic SIPl, a mammalian Gemin3, and chicken SMN protein as 
disclosed herein. Analogs may differ from naturally occurring proteins or peptides by 
conservative amino acid sequence differences or by modifications which do not affect 
sequence, or by both. For example, conservative amino acid changes may be made, 
which although they alter the primary sequence of the protein or peptide, do not 
normally alter its function. Conservative amino acid substitutions typically include 
substitutions within the following groups: 

glycine, alanine; 

valine, isoleucine, leucine; 

aspartic acid, glutamic acid; 

asparagine, glutamine; 

serine, threonine; 

lysine, arginine; 

phenylalanine, tyrosine. 
Modifications (which do not normally alter primary sequence) include in vivoy or in 
vitro, chemical derivatization of polypeptides, eg., acetylation, or carboxylation. Also 
included are modifications of glycosylation, e.g., those made by modifying the 
glycosylation pattems of a polypeptide during its synthesis and processing or in further 
processing steps; e,g., by exposing the polypeptide to enzymes which affect 
glycosylation, eg., mammalian glycosylating or deglycosylating enzymes. Also 
embraced are sequences which have phosphorylated amino acid residues, e.g., 
phosphotyrosine, phosphoserine, or phosphothreonine. 



10873 V.2 



-47- 



Also included are polypeptides which have been modified using 
ordinary molecular biological techniques so as to improve their resistance to 
proteolytic degradation or to optimize solubility properties or to render them more 
suitable as a therapeutic agent. Analogs of such polypeptides include those containing 
residues other than naturally occurring L-amino acids, e.g., D-amino acids or non- 
naturally occurring synthetic amino acids. The peptides of the invention are not 
limited to products of any of the specific exemplary processes listed herein. 

The present invention should also be construed to encompass "mutants," 
"derivatives," and "variants" of the peptides of the invention (or of the DNA encoding 
the same) which mutants, derivatives and variants are SMN, SIPl, and Gemin3 
peptides which are altered in one or more amino acids (or, when referring to the 
nucleotide sequence encoding the same, are altered in one or more base pairs) such that 
the resulting peptide (or DNA) is not identical to the sequences recited herein, but has 
the same biological property as the peptides disclosed herein, in that the peptide has 
biologicaL^iochemical properties of the SMN, SIPl, or Gemin3 peptide of the present 
invention. A biological property of a SMN protein should be construed but not be 
limited to include, the ability of the peptide to bind specifically and avidly with another 
SMN molecule and also to bind with SIPl, GeminS, and various Sm proteins. 

Likewise, a biological property of the GeminS peptide of the invention 
is to bind specifically with SMN. 

A biological activity of the SIPl of the invention is the ability to bind 
specifically with SMN. 

Similarly, a biological fiinction of Gemin3 is the abiUty to specifically 
bind with SMN. In addition, a biological property of Gemin3 is the ability to function 
as a helicase as indicated by the presence of a DEAD motif conserved in RNA 
helicases as reviewed by reviewed in Staley and Guthrie (1998, Cell 92:315-326). 
Further, a biological activity of SMN, SIPl, and Gemin3 is to be able to participate in 
mRNA biosynthesis. 

Further, the invention should be construed to include naturally occurring 
variants or recombinantly derived mutants of SIPl and Gemin3 sequences, which 

10873 V.2 -48 - 



variants or mutants render the protein encoded thereby either more, less, or just as 
biologically active as the full-length proteins of the invention. 

The nucleic acids, and peptides encoded thereby, are useful tools for 
elucidating the function(s) of SMN, SIPl and Gemin3 in a cell. Further, they are 
5 useful for localizing the nucleic acid, protein, or both, in a cell and for assessing the 
level of expression of the nucleic acid and/or protein under selected conditions 
including in response to therapeutic treatment. Further, nucleic and amino acids 
comprising eukaryotic SIPl, and mammalian Gemin3 are useful diagnostics which can 
be used, for example, to identify a compound that affects expression of the protein and 

10 is a candidate SMA therapeutic. 

In addition, the nucleic acids, the proteins encoded thereby, or both, can 
be administered to a mammal to increase or decrease expression of SIPl or GeminS in 
the mammal. This can be therapeutic to the mammal if under or over-expression of 
SIPl or Gemin3 in the mammal mediates a disease or condition associated with altered 

15 expression of the protein compared with normal expression of SIPl and/or Gemin3 in a 
healthy manunal. Additionally, the nucleic and amino acids of the invention can be 
used to produce recombinant cells which are useful tools for the study of SMA, the 
identification of novel SMA therapeutics, and for elucidating the cellular role(s) of 
SIPl and GeminS in mRNA biosynthesis and other cell processes, among other things. 

20 Further, the nucleic and amino acids of the invention can be used diagnostically, either 
by assessing the level of gene expression or protein expression and the biological 
activity of the protein, to assess severity and prognosis of SMA. 

In other related aspects, the invention includes a nucleic acid encoding a 
eukaryotic SIPl operably linked to a nucleic acid comprising a promoter/regulatory 

25 sequence such that the nucleic acid is preferably capable of directing expression of the 

protein encoded by the nucleic acid. 

Expression of SIPl either alone or fused to a detectable tag polypeptide 
in cells which either do not normally express SIPl or which do not express SIPl 
comprising a tag polypeptide, can be accomplished by operably linking the nucleic acid 

30 encoding SIPl to a promoter/regulatory sequence which serves to drive expression of 

10873 V.2 AQ 



the protein, with or without a tag polypeptide, in cells in which the exogenous nucleic 
acid (Le. transgene) is introduced. 

Many promoter/regulatory sequences useful for driving constitutive 
expression of a gene are available in the art and include, but are not limited to, for 
5 example, the cytomegalovirus immediate early promoter enhancer sequence, the S V40 
early promoter, both of which were used in the experiments disclosed herein, as well as 
the Rous sarcoma virus promoter, and the like. Moreover, inducible and tissue specific 
expression of the nucleic acid encoding SIPl may be accomplished by placing the 
nucleic acid encoding SIPl, with or without a tag, under the control of an inducible or 

10 tissue specific promoter/regulatory sequence. Examples of tissue specific or inducible 

promoter/regulatory sequences which are useful for his purpose include, but are not 
limited to the MMTV LTR inducible promoter, and the SV40 late enhancer/promoter. 
In addition, promoters which are well known in the art which are induced in response 
to inducing agents such as metals, glucocorticoids, and the like, are also contemplated 

15 in the invention. Thus, it will be appreciated that the invention includes the use of any 
promoter/regulatory sequence, which is either known or unknown, and which is 
capable of driving expression of the desired protein encoded by a nucleic acid operably 
linked to the promoter/regulatory sequence. 

Similarly, the invention includes a nucleic acid encoding a mammalian 

20 Gemin3 operably linked to a nucleic acid comprising a promoter/regulatory sequence 
such that the nucleic acid is preferably capable of directing expression of the protein 
encoded by the nucleic acid. Further, the present invention encompasses a nucleic acid 
encoding a chicken SMN operably linked to a nucleic acid comprising a 
promoter/regulatory sequence. 

25 Expressing SIPl, Gemin3, or chicken SMN using a promoter/regulatory 

sequence allows the isolation of large amounts of recombinantly produced protein. 
Further, where the lack or decreased level of SIPl, Gemin3, or SMN expression causes 
a disease or condition associated with such expression, the expression of the protein 
driven by a promoter/regulatory sequence can provide useful therapeutics including, 

30 but not limited to, gene therapy whereby the protein is provided. 



10873 V.2 



-50- 



Vectors and recombinant cells 

The invention also includes a vector comprising a nucleic acid encoding 
a eukaryotic SIPl, a mammalian Gemin3, or a chicken SMN. The incorporation of a 
desired nucleic acid into a vector and the choice of vectors is well-known in the art as 
5 described in, for example, Sambrook et aL, supra, and Ausubel et al., supra. 
Further, the invention encompasses expression vectors and methods for the 
introduction of exogenous nucleic acid encoding SMN, SIPl, and Gemin3 into a cell 
with concomitant expression of the exogenous nucleic acid in the cell using such 
methods as those described in, for example, Sambrook et aL (1989, supra), and 

10 Ausubel et al. (1997, supra), and as disclosed elsewhere herein. 

Expression of chicken SMN from a plasmid under the control of an inducible 
promoter/regulatory sequence allows the selective expression of SMN in a cell lacking 
endogenous SMN expression. As more fully set forth below, this cell provides a stable 
genetic system for the study of the role(s) of SMN in SMA and for the identification of 

1 5 SMA therapeutics to treat the effects of the lack of SMN in the cell. 

Selection of any particular plasmid vector or other DNA vector is not a 
limiting factor in this invention and a wide plethora vectors is well-known in the art. 
Further, it is well within the skill of the artisan to choose particular promoter/regulatory 
sequences and operably link those promoter/regulatory sequences to a DNA sequence 

20 encoding a desired polypeptide. Such technology is well known in the art and is 
described, for example, in Sambrook, supra, and Ausubel, supra. 

The invention includes also cells, viruses, proviruses, and the Hke, 
containing such vectors. Methods for producing cells comprising vectors and/or 
exogenous nucleic acids are well-known in the art. See, e.g., Sambrook et al., supra, 

25 Ausubel et al., supra. 

The nucleic acids encoding SIPl, Gemin3, or chicken SMN can be 
cloned into various plasmid vectors. However, the present invention should not be 
construed to be limited to plasmids or to any particular vector. Instead, the present 
invention should be construed to encompass a wide plethora of vectors which are 

30 readily available and/or well-known in the art. 



10873 V.2 



-51- 



The invention also includes the expression of human SIPl in non- 
human cells where it is not normally expressed or expression of SIPl -tagged fusion 
protein in cells where this fusion protein is not normally expressed. In a preferred 
embodiment, human SIPl, human SMN, and GeminS nucleic acid tagged with a 
S nucleic acid expressing a tag polypeptide was used to transiently transfect a 

mammalian cell. Plasmid constructs containing SMN, SIPl, or GeminS, or mutants, 
variants, derivatives and fragments thereof, were cloned into several vectors 
comprising a nucleic acid encoding a tag polypeptide {see, e.g., Nakielny and Dreyfuss, 
1996, J. Cell. Biol. 134:1365-1373). The plasmids were introduced into the cell using 

10 standard methods (e.g., calcium phosphate, electroporation, and the like). 

In another embodiment, human SEPl, or SMN were expressed in Xenopus laevis 
oocytes where the exogenous nucleic acid encoding the mammalian proteins were 
introduced into the oocytes by standard microinjection techniques. The present 
invention also encompasses expression of these exogenous nucleic acids in amphibian 

15 and other non-mammalian cells (e.g. yeast, insect, and avian cells) using methods well- 
known in the art such as those disclosed elsewhere herein. Thus, it is clear that the 
invention is not limited to any particular vector or to any particular method of 
introducing the exogenous nucleic acid encoding at least one of SMN, SIPl, and 
Gemin3 into a cell. 

20 Expression of proteins of interest (e.g., SMN, SIPl, and Gemin3) in a 

cell, especially when the protein comprises a tag polypeptide, allows localization of the 
nucleic acid and/or the protein expressed therefrom within the cell under selected 
conditions such that the function(s) of the protein in the cell can be studied and 
identified. 

25 One skilled in the art would appreciate, based upon the disclosure 

provided herein, that the invention also includes expression of SIPl, SMN, Gemin3, 
and the like, in prokaryotic cells (e.g., bacterial cells such as, for example, E. coli). 
Accordingly, the invention includes expression of the proteins of the invention in such 
cells as well. 



10873 V.2 



-52- 



In another preferred embodiment, SMN and SBPl nucleic acids which 
expressed a tag polypeptide were also expressed in an in vitro transcription-translation 
system as well as in various E. coli strains to produce SMN or SIPl protein variants or 
fragments fused to a tag polypeptide. Thus, the invention encompasses the expression 
5 of SMN, SIPl, and Gemin in a cell free system in addition to the expression of such 
proteins in a cell. 

However, the invention should not be construed as being limited to 
these plasmid vectors, bacterial strains, or to these tag polypeptides. Fiuther, the 
invention is not limited to calcium phosphate transfection or to HeLa cells as 

10 exemplified herein. Instead, the invention encompasses other expression vectors and 

methods for the introduction of exogenous DNA into cells with concomitant expression 
of the exogenous DNA in the cells such as those described, for example, in Sambrook 
et al. (1989, supra), and Ausubel et al (1997, supra). 

The invention also includes a cell comprising an isolated nucleic acid 

15 encoding at least one of an SIPl, and a Gemin3, or a vector comprising the same. 

The invention relates to a cell line as an in vitro model of SMA for the study of defects 
in SMN and its associated proteins in mRNA biogenesis and for the screening of 
compounds which affect SMN activity. The cell line of the invention is one which 
exhibits reduced expression of SMN when compared with normal levels of expression 

20 of SMN in cells or, altematively, exhibits increased or altered expression of SMN 
when compared to the level of SMN expression in an otherwise identical cell which 
does not comprise the exogenous nucleic acid encoding SMN. 
Cell lines can be created which exhibit enhanced expression of SMN; however, the 
preferred cell line of the invention is one in which the expression of SMN is reduced 

25 when compared with normal expression of SMN, The cell line of the invention should 
not be constmed to be limited by the manner in which the expression of SMN is 
reduced when compared with normal expression of SMN, in that there are any number 
of ways to reduce expression of SMN in a cell 

In one embodiment, the cell line is mammalian cell comprising an 

30 expression vector comprising the human SMN cDNA constitutively expressed under 

10873 V.2 CO 



the control of a high-level expression promoter/regulatory sequence. The cells may be 
transfected with constructs which comprise SMN cDNA in either a sense (/.e., sense 
cells) or an antisense orientation (Le,, antisense cells). 

In another embodiment exemplified herein, the SMN modulating 
5 sequence was a plasmid wherein a sequence encoding a fusion protein comprising an 
HA tag polypeptide fused to the amino-terminus of chicken SMN (HA-cSMN) was 
expressed under a tetracycline repressible promoter. As the data disclosed herein 
demonstrate, removal of tetraclycline allowed derepression of the promoter such that 
the HA-cSMN fusion was expressed in the cell (e.g., a chicken pre-B lymphoid DT40 

10 cell although other cell lines can be used). Without wishing to be bound by theory, 

expression of the HA-cSMN caused expression of endogenous cSMN to be inhibited 
such that only expression of HA-SMN was detected in the cell. This apparent 
autoregulation of SMN expression, in tum, was lethal since the HA-cSMN did not 
replace endogenous wild type cSMN in the cell. Thus, one skilled in the art would 

1 5 appreciate, based upon the disclosure provided herein, that this cell, wherein a non- 
functional SMN replaces endogenous SMN, provides an important system analogous to 
a cell of an SMA patient in that the expression and, therefore, function of SMN are 
inhibited. Thus, this cell of the invention provides a useful system for the study of the 
role(s) of SMN in SMA and also for the identification of a compound useful for 

20 treatment of SMA which is characterized by the lack of endogenous expression of 
functional SMN. 

One skilled in the art would further appreciate that selected forms of 
SMN may be introduced to the cell of the invention in order to study the effect of any 
mutant, derivative, and variant of SMN (e.g., fusion proteins comprising at least a 
25 portion of SMN and a tag polypeptide) in this system. 

Additionally, in another embodiment, the invention relates to a cell line 
transfected with empty vector only (i.e., vector-only cells also referred to as "control 
cells"). 

Additionally, the use of ribozymes to effect a reduction in the 
30 expression of SMN in a cell line is contemplated, as is the use of any other means 

10873 V.2 -54- 



which would effect a reduction in expression of SMN in a cell line such as the use of 
knock-out and knock-in techniques as described elsewhere herein to affect SMN 
expression in a cell 

One skilled in the art would appreciate that an antisense cell line, for 
5 example, serves as an in vitro model for SMA in that these ceils contain reduced 

amounts of SMN which is analogous to the diseased motor neurons of SMA patients. 
Further, these cells can also demonstrate altered morphologies and growth 
characteristics which are also similar to naturally occurring SMA cell pathology. 
However, the invention is not limited to a cell line expressing lower levels of SMN 

10 protein. Indeed, the invention also discloses methods of producing cell lines which 

exhibit increased levels of SMN, i,e., cells transfected with vector encoding SMN in a 
sense orientation under the control of a constitutive promoter/regulatory sequences 
which drives expression of SMN protein at higher levels than that found in cells which 
are not transfected or which are transfected with an empty vector only. 

15 The invention is not limited to these cell lines or to any particular altered growth 

characteristics. Rather, other cell lines may be developed using the methods described 
herein to produce in vitro models of SMA. The identity of the cell line is not critical, 
except that the cell line must exhibit reduced expression of SMN protein which is 
analogous to the reduced level of expression exhibited by the cells of SMA patients. 

20 Alternatively, the invention encompasses production of a cell that exhibits an increased 
level of SMN protein expression compared with an otherwise identical cell which is 
not altered in any way. Examples of cell lines which may produce proteins encoded by 
a variety of expression vectors are found throughout the scientific literature. Also, the 
manner of providing the SMN modulating sequence to the cell is not critical except 

25 that, whether it is transcribed or translated, the nucleic acid must decrease or increase 
the levels of SMN available to participate in mRNA biogenesis compared with 
otherwise identical cells to which the SMN modulating sequence has not been 
provided. 

One skilled in the art will appreciate, based on the disclosure provided herein, that the 
30 level of SMN protein in a cell can be decreased by reducing or inhibiting expression of 

10873 V.2 



the SMN gene. Thus, the level of SMN protein in a cell can be decreased using a 
molecule or compound that inhibits or reduces gene expression such as, for example, 
antisense molecules or ribozymes. 

In a preferred embodiment, the SMN modulating sequence is a SMN 
antisense nucleic acid sequence which is expressed by a plasmid vector and used to 
transfect a mammalian cell thereby causing reduced endogenous expression of SMN 
protein in the cells. However, as stated previously herein, the invention should not be 
construed to be limited to inhibiting expression of SMN by transfection of cells with 
antisense molecules. Rather, the invention encompasses other methods known in the 
art for inhibiting expression of SMN protein in cells including, but not limited to, the 
use of ribozymes, and expression of a non-functional SMN under the control of, for 
example, an inducible promoter, and the like. 

In another preferred embodiment, the transfected cells exhibit altered 
growth characteristics compared with cells which are either not transfected or which 
are transfected with an empty plasmid vector or with a vector containing SMN in a 
sense orientation. 

The invention should not be construed as being limited to a cell 
exhibiting any particular altered growth characteristics or to a cell exhibiting any 
particular combination of aUered characteristics. Rather, the invention includes other 
ahered morphological or growth features and combinations thereof. 

Further, the invention includes a recombinant cell comprising an 
antisense nucleic acid which cell is a useful model for the study of SMA and/or other 
conditions associated with or mediated by inhibition of mRNA biosynthesis and for 
elucidating the role(s) of SIP 1, SMN, and/or Gemin3 in such processes. That is, the 
lack of expression of SMN in SMA patients, among other things, indicated that SMN 
was involved in SMA. Accordingly, a recombinant {i.e., transgenic) cell comprising an 
antisense nucleic acid complementary to SMN is a useful tool for the study of the 
mechanism(s) of action of SMN and its role(s) in the cell and for the identification of 
therapeutics that ameliorate the efifect(s) of decreased levels of SMN expression. 



10873 V.2 



-56- 



One skilled in the art will appreciate that one way to decrease the levels 
of SMN mRNA and/or protein in a cell is to inhibit expression of the nucleic acid 
encoding the protein. Expression of SMN may be inhibited using, for example, 
antisense molecules, nucleic acids encoding non-functional SMN, and also by using 
5 ribozymes. 

Antisense molecules and their use for inhibiting gene expression are 
well known in the art {see, e,g„ Cohen, 1989, In: Oligodeoxyribonucleotides, 
Antisense Inhibitors of Gene Expression, CRC Press). Antisense nucleic acids are 
DNA or RNA molecules that are complementary, as that term is defined elsewhere 
10 herein, to at least a portion of a specific mRNA molecule (Weintraub, 1990, Scientific 
American 262:40). In the cell, antisense nucleic acids hybridize to the corresponding 
mRNA, forming a double-stranded molecule thereby inhibiting the translation of 
genes. 

The use of antisense methods to inhibit the translation of genes is 

1 5 known in the art, and is described, for example, in Marcus-Sakura (988, Anal. 

Biochem. 172:289). Such antisense molecules may be provided to the cell via genetic 
expression using DNA encoding the antisense molecule as taught by Inoue, 1993, U.S. 
Patent No. 5,190,931. 

Alternatively, antisense molecules of the invention may be made 

20 synthetically and then provided to the cell. Antisense oligomers of between about 10 
to about 30, and more preferably about 15 nucleotides, are preferred, since they are 
easily synthesized and introduced into a target cell. Synthetic antisense molecules 
contemplated by the invention include oligonucleotide derivatives known in the art 
which have improved biological activity compared to unmodified oligonucleotides (see 

25 Cohen, supra', Tullis, 1991, U.S. Patent No. 5,023,243, incorporated by reference 
herein in its entirety). 

Ribozymes and their use for inhibiting gene expression are also well 
known in the art (see, e.g., Cech et al., 1992, J. Biol. Chem. 267:17479-17482; Hampel 
et al., 1989, Biochemistry 28:4929-4933; Eckstein et al., Intemational Publication No. 

30 WO 92/07065; Altman et al., U.S. Patent No. 5,168,053, incoiporated by reference 

10873 V.2 



herein in its entirety). Ribozymes are RNA molecules possessing the ability to 
specifically cleave other single-stranded RNA in a manner analogous to DNA 
restriction endonucleases. Through the modification of nucleotide sequences encoding 
these RNAs, molecules can be engineered to recognize specific nucleotide sequences in 
an RNA molecule and cleave it (Cech, 1988, J. Amer. Med. Assn. 260:3030). A major 
advantage of this approach is that, because they are sequence-specific, only mRNAs 
with particular sequences are inactivated. 

There are two basic types of ribozymes, namely, tetrahymena-type 
(Hasselhoff, 1988, Nature 334:585) and hammerhead-type. Tetrahymena-type 
ribozymes recognize sequences which are four bases in length, while hammerhead-type 
ribozymes recognize base sequences 1 1-18 bases in length= The longer the sequence, 
the greater the likelihood that the sequence will occur exclusively in the target mRNA 
species. Consequently, hammerhead-type ribozymes are preferable to tetrahymena- 
type ribozymes for inactivating specific mRNA species, and 18-base recognition 
sequences are preferable to shorter recognition sequences which may occur randomly 
within various unrelated mRNA molecules. 

Ribozymes useful for inhibiting the expression of SMN may be 
designed by incorporating target sequences into the basic ribozyme structure which are 
complementary to the mRNA sequence of the SMN encoded by SAdN or having at least 
about 80% homology to chicken SMN (SEQ ID NO:9). Ribozymes targeting SMN 
may be synthesized using commercially available reagents (Applied Biosystems, Inc., 
Foster City, CA) or they may be genetically expressed fi-om DNA encoding them. 

The invention fiirther includes a recombinant cell comprising an isolated 
nucleic acid encoding SMN, SIPl, or Gemin3. The transgenic cell may be transiently 
transfected with a plasmid encoding a portion of the nucleic acid encoding the protein 
of interest, .e.g., SMN. The nucleic acid need not be integrated into the cell genome 
nor does it need to be expressed in the cell. Moreover, the cell may be a prokaryotic or 
a eukaryotic cell and the invention should not be construed to be limited to any 
particular cell line or cell type. Such cells include, but are not limited to, Xenopus 



10873 V.2 



-58- 



oocytes, chicken DT40 pre-B lymphoid cells, mammalian neuronal cells, E. colU and 
the like. 

When the cell is a eukaryotic cell, the cell may be any eukaryotic cell 
which, when the isolated nucleic acid of the invention is introduced therein, and the 
5 protein encoded by the desired gene is no longer expressed therefrom, a benefit is 

obtained. Such a benefit may include the fact that there has been provided a system in 
which lack of expression of the desired gene can be studied in vitro in the laboratory or 
in a mammal in which the cell resides, a system wherein cells comprising the 
introduced gene deletion can be used as research, diagnostic and therapeutic tools, and 
10 a system wherein animal models are generated which are useful for the development of 
new diagnostic and therapeutic tools for selected disease states in a mammal including, 
for example, SMA. 

Ahematively, the invention includes a eukaryotic cell which, when the transgene of the 
invention is introduced therein, and the protein encoded by the desired gene is 

1 5 expressed therefrom where it was not previously present or expressed in the cell or 
where it is now expressed at a level or under circumstances different than that before 
the transgene was introduced, a benefit is obtained. Such a benefit may include the fact 
that there has been provided a system wherein the expression of the desired gene can 
be studied in vitro in the laboratory or in a mammal in which the cell resides, a system 

20 wherein cells comprising the introduced gene can be used as research, diagnostic and 
therapeutic tools, and a system wherein animal models are generated which are useful 
for the development of new diagnostic and therapeutic tools for selected disease states 
in a mammal (e.g., SMA). 

The present invention is not limited to a cell line wherein expression of 

25 SMN is affected by introduction of an SMN modulating sequence to the cell. That is, 

one skilled in the art would appreciate, based on the disclosure provided herein, that 
there are other methods of inhibiting or reducing expression of a nucleic acid encoding 
a protein of interest (e.g., SMN, SIP 1, and Gemin3). Thus, the invention further 
includes a cell line wherein the endogenous nucleic acid encoding the protein of 

30 interest has been removed, either in part or in its entirety, from the cell chromosome. 

10873V.2 rQ 



Further, the invention includes a cell line where the endogenous nucleic acid encoding 
a protein has been removed where the cell further comprises a vector comprising a 
nucleic acid encoding the protein such that the expression of the protein in such cell 
can be controlled. 

5 Therefore, the present invention includes a cell line wherein endogenous SMN 

expression has been ablated or significantly reduced and, instead, inducible SMN 
expression from a stably transfected plasmid has an been provided to the cell. One 
skilled in the art would appreciate, based upon the disclosure provided herein, that this 
stable genetic system provides an important tool for the study of effect of SMN 

10 expression in a cell, the role(s) of SMN and proteins associated therewith in cell 

processes, and for the identification of therapeutics useful for the treatment of SMA. 

One of ordinary skill would appreciate, based upon the disclosure 
provided herein, that a targeting vector useful for inserting (i.e., a "knock-in" targeting 
vector) or deleting (i.e., a "knock-out" targeting vector) nucleic acid sequences of 

15 interest in a cell comprises at least two sequences homologous to two portions of the 

nucleic acid which is to be deleted or replaced. In the case of the knock-out targeting 
vector, the two sequences are homologous with sequences that flank the gene; that is, 
one sequence is homologous with a region at or near the 5' portion of the coding 
sequence of the nucleic acid encoding, for example, SMN, and the other sequence is 

20 further downstream from the first. One skilled in the art would appreciate, based upon 
the disclosure provided herein, that the present invention is not limited to any specific 
flanking nucleic acid sequences. Instead, the targeting vector may comprise two 
sequences which remove some or all (in the case of a "knock-in" vector) or which 
insert (in the case of a "knock-in" vector) a nucleic encoding SMN, or a fragment 

25 thereof, into a mammalian genome. The crucial feature of the targeting vector is that it 

comprise sufficient portions of two sequences located towards opposite, i.e., 5' and 3*, 
ends of the SMN opon reading frame (ORF) such as to allow homologous 
recombination to occur such that all or a portion of the nucleic acid encoding SMN is 
deleted from a location on a mammalian chromosome. Alternatively, one of ordinary 

30 skill in the art would appreciate that a knock-in targeting vector preferably comprises 

10873 V.2 .60. 



sequences homologous to and flanking a location on a mammalian chromosome which 
will be exchanged with the gene to be inserted (e,g., SMN, SIPl, and Gemin3). The 
design of transgenes and knock-in and knock-out targeting vectors is well-known in the 
art and is described in standard treatises such as Sambrook et al. (1989, Molecular 
5 Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York), and in 
Ausubel et al (1997, Current Protocols in Molecular Biology, John Wiley & Sons, 
New York), and the like. The upstream and downstream portions flanking or within 
the chicken iSMA^ coding region, or sequences flanking a region to be replaced with 
SMN by a knock-in vector, to be used in the targeting vector may be easily selected 

10 based upon known methods and following the teachings disclosed herein based on the 
disclosure provided herein including the nucleic and amino acid sequences of chicken 
SMN, Armed with these sequences, one of ordinary skill in the art would be able to 
construct the transgenes and knock-out vectors of the invention. 

The invention further includes a knock-out targeting vector comprising 

15 a nucleic acid encoding a selectable marker such as, for example, a nucleic acid 
encoding the neo gene thereby allowing the selection of a transgenic (i.e., 
recombinant) cell where the nucleic acid encoding SMN, or a portion thereof, has been 
deleted and replaced with the neomycin resistance gene thereby conferring on the 
recombinant cell the ability to grow in the presence of G418. However, the present 

20 invention should not be construed to be limited to neomycin resistance as a selectable 
marker. Rather, other selectable markers well-known in the art may be used in the 
knock-out targeting vector to allow selection of transgenic cells where the SMNgQuc 
has been deleted and/or inactivated and replaced by the nucleic acid encoding the 
selectable marker of choice. Methods of selecting and incorporating a selectable 

25 marker into a vector are well-known in the art and are described in, for example, 

Sambrook et al. (1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, New York), and in Ausubel et al. (1997, Current Protocols in Molecular 
Biology, John Wiley & Sons, New York). 
Antibodies 



10873 V.2 



-61 - 



The invention also includes an antibody that specifically binds SIPl, or 
a fragment thereof. In one aspect, the antibody specifically binds a protein having the 
amino acid sequence of at least one of SEQ ID N0:2, SEQ ID N0:4, and SEQ ID 
NO:6, or a protein sharing about 20% homology with a protein having those sequences. 
5 In one embodiment disclosed herein, the antibody is a murine monoclonal antibody 
{e.g,, 2S7, 2E17). However, the present invention is not limited to the particular 
antibodies exemplified herein nor is the invention limited to monoclonal antibodies. 
Rather, the invention encompasses polyclonal, monoclonal, synthetic antibodies, and 
the like. One skilled in the art would understand, based upon the disclosure provided 

10 herein, that the crucial feature of the antibody of the invention is that the antibody bind 
specifically with SIPl in a manner similar to those antibodies disclosed herein. More 
specifically, the antibody of the invention recognizes SIPl (e.g., human, XenopuSy and 
mouse), or a fragment thereof, on Western blots, in immxmostaining of cells, and 
immunoprecipitates SIPl using standard methods well-known in the art. Moreover, 

1 5 selected antibodies of the invention can inhibit mRNA splicing reaction and or the 

formation of the SMN-SIPl complex in vivo and/or in vitro. 

In addition, the invention includes an antibody that specifically binds 
with GeminS, or a fragment thereof In one aspect, the antibody specifically binds a 
protein having the amino acid sequence of SEQ ID NO: 10. In another jaspect, the 

20 antibody specifically binds with a protein sharing about 20% homology with a protein 
having the sequence SEQ ID NO: 10, or a fragment thereof 

In one embodiment disclosed herein, the antibody is a murine 
monoclonal antibody (e.g., 1 1G9, 12H12). However, the present invention is not 
limited to the particular antibodies exemplified herein nor is the invention limited to 

25 monoclonal antibodies. Rather, the invention encompasses polyclonal, monoclonal, 

synthetic antibodies, and the like. One skilled in the art would understand, based upon 
the disclosure provided herein, that the crucial feature of the antibody of the invention 
is that it bind specifically with GeminS in a manner similar to the antibodies disclosed 
herein. More specifically, the antibody of the invention recognizes Gemin3, or a 



10873 V.2 



-62- 



fragment thereof, on Western blots, in immunostaining of cells, and 
immunoprecipitates Gemin3 using standard methods well-known in the art. 
In addition, the invention includes an antibody that specifically binds with SMN, or a 
fragment thereof In one aspect, the antibody specifically binds a protein having the 
5 amino acid sequence of SEQ ID N0:8. In another aspect, the antibody specifically 
binds with a protein sharing about 20% homology with a protein having the sequence 
SEQ ID N0:8, or a fragment thereof 

In one embodiment disclosed herein, the antibody is a murine 
monoclonal antibody (e.g., 2B1) that specifically binds with chicken SMN. However, 

10 the present invention is not limited to the particular antibody exemplified herein nor is 
the invention limited to monoclonal antibodies. Rather, the invention encompasses 
polyclonal, monoclonal, synthetic antibodies, and the like. One skilled in the art would 
appreciate, based upon the disclosure provided herein, that the crucial feature of the 
antibody of the invention is that it bind specifically with SMN in a manner similar to 

15 those antibodies disclosed herein. More specifically, the antibody of the invention 

recognizes human SMN, or a fragment thereof, on Western blots, in immxmostaining of 
cells, and immunoprecipitates SMN using standard methods well-known in the art. 
Further, preincubation of a splicing extract can also inhibit pre-mRNA splicing as 
disclosed elsewhere herein. 

20 In addition, the invention includes an antibody to chicken SMN. Such 

antibody can be a monoclonal antibody, a monoclonal antibody, and the like, as 
previously described elsewhere herein. 

The antibodies of the invention are usefiil for the diagnosis, assessment 
and treatment of SMA. These antibodies are also useful for elucidating the 

25 mechanism(s) for pre-mRNA splicing and the causal relationship between such 

processes and SMA. Further, the antibodies can be used to localize the relevant protein 
in a cell and to study the role(s) of the antigen recognized thereby in cell processes. 
Moreover, the antibodies can be used to detect and or measure the amount of protein 
present in a biological sample using well-known methods such as, but not limited to, 

30 Western blotting and enzyme-linked immunosorbent assay (ELISA). Moreover, the 

10873 V.2 - 63 - 



antibodies can be used to immunoprecipitate and/or inununo-affinity purify their 

cognate antigen as described elsewhere herein. 

The generation of polyclonal antibodies is accomplished by inoculating 

the desired animal with the antigen and isolating antibodies which specifically bind the 
5 antigen therefrom using standard antibody production methods such as those described 

in, for example, Harlow et al. (1988, In: Antibodies, A Laboratory Manual, Cold 

Spring Harbor, NY). 

Monoclonal antibodies directed against full length or peptide fragments 

of a protein or peptide can be prepared using any well known monoclonal antibody 
10 preparation procedures, such as those described, for example, in Harlow et al., 1988, 

supra, and in Tuszynski et al. (1988, Blood, 72:109-1 15), and methods set forth 

elsewhere herein. Quantities of the desired peptide may also be synthesized using 

chemical synthesis technology. Alternatively, DNA encoding the desired peptide may 

be cloned and expressed jfrom an appropriate promoter sequence in cells suitable for 
15 the generation of large quantities of peptide. Monoclonal antibodies directed against 

the peptide are generated from mice immunized with the peptide using standard 

procedures as referenced herein. 

Nucleic acid encoding the monoclonal antibody obtained using the 

procedures described herein may be cloned and sequenced using technology which is 
20 available in the art, and is described, for example, in Wright et al. (1992, Critical Rev, 

Immunol 12:125-168), and the references cited therein. Further, the antibody of the 

invention may be "humanized" using the technology described in Wright et al. (supra), 

and in the references cited therein, and in Gu et al. (1997, Thrombosis and Hematocyst 

77:755-759). 

25 To generate a phage antibody library, a cDNA library is first obtained 

from mRNA which is isolated from cells, e.g., the hybridoma, which express the 
desired protein to be expressed on the phage surface, e.g., the desired antibody. cDNA 
copies of the mRNA are produced using reverse transcriptase. cDNA which specifies 
immunoglobulin fragments are obtained by PCR and the resulting DNA is cloned into 

30 a suitable bacteriophage vector to generate a bacteriophage DNA library comprising 



10873 V.2 



-64- 



DNA specifying immunoglobulin genes. The procedures for making a bacteriophage 
library comprising heterologous DNA are well known in the art and are described, for 
example, in Sambrook et al., supra. 

Bacteriophage which encode the desired antibody, may be engineered 
5 such that the protein is displayed on the surface thereof in such a manner that it is 
available for binding to its corresponding binding protein, e.g., the antigen against 
which the antibody is directed. Thus, when bacteriophage which express a specific 
antibody are incubated in the presence of a cell which expresses the corresponding 
antigen, the bacteriophage will bind to the cell. Bacteriophage which do not express 

10 the antibody will not bind to the cell. Such panning techniques are well known in the 
art and are described for example, in Wright et al. (supra). 

Processes such as those described above, have been developed for the 
production of human antibodies using Ml 3 bacteriophage display (Burton et al., 1994, 
Adv. Inmiunol. 57:191-280). Essentially, a cDNA library is generated from mRNA 

15 obtained from a population of antibody-producing cells. The mRNA encodes 

rearranged immunoglobulin genes and thus, the cDNA encodes the same. Amplified 
cDNA is cloned into Ml 3 expression vectors creating a library of phage which express 
human Fab fragments on their surface. Phage which display the antibody of interest 
are selected by antigen binding and are propagated in bacteria to produce soluble 

20 human Fab immunoglobulin. Thus, in contrast to conventional monoclonal antibody 
synthesis, this procedure immortalizes DNA encoding hxmian inmiunoglobulin rather 
than cells which express human immunoglobulin. 

The procedures just presented describe the generation of phage which 
encode the Fab portion of an antibody molecule. However, the invention should not be 

25 construed to be limited solely to the generation of phage encoding Fab antibodies. 

Rather, phage which encode single chain antibodies (scFv/phage antibody libraries) are 
also included in the invention. Fab molecules comprise the entire Ig light chain, that is, 
they comprise both the variable and constant region of the light chain, but include only 
the variable region and first constant region domain (CHI) of the heavy chain. Single 

30 chain antibody molecules comprise a single chain of protein comprising the Ig Fv 



10873 V.2 



-65- 



fragment. An Ig Fv fragment includes only the variable regions of the heavy and light 
chains of the antibody, having no constant region contained therein. Phage libraries 
comprising scFv DNA may be generated following the procedures described in Marks 
et al. (1991, J. Mol. Biol 222:581-597). Panning of phage so generated for the 
5 isolation of a desired antibody is conducted in a manner similar to that described for 
phage libraries comprising Fab DNA. 

The invention should also be construed to include synthetic phage 
display libraries in which the heavy and light chain variable regions may be 
synthesized such that they include nearly all possible specificities (Barbas, 1995, 
10 Nature Medicine 1:837-839; de Kruif et al. 1995, J. Mol. Biol. 248:97-105). 
Compositions 

The invention includes a composition comprising an isolated purified 
polypeptide comprising a eukaryotic SIPl (e.g., human SIPl, frog SIP 1, and yeast 
homolog Brrl). Preferably, the composition comprises a pharmaceutically acceptable 
15 carrier. 

The invention includes a composition comprising an isolated purified 
SMN and a protein that specifically binds SMN (e.g., another SMN, SIPl, Gemin3, 
and various Sm proteins). The composition comprises the afore-mentioned proteins in 
any combination or permutation thereof In one aspect, the composition fiirther 
20 comprises a ribonucleic acid, such as, but not limited to, snRNA. The composition and 
the various permutations thereof, are an important tool for elucidating the precise 
structure of the mRNA splicing complex and the mechanism(s) involved in mRNA 
biosynthesis. 

Also included in the invention is a composition comprising an antibody 
25 that specifically binds SMN. Preferably, the composition comprises a 

pharmaceutically-acceptable carrier. Such an antibody can be administered to a SMA 
patient since the data disclosed herein demonstrate that addition of anti-SMN mAb 2B1 
increases aggregation of SMN which is deficient in SMA-afflicted humans since the 
SMN of SMA patients exhibits a decreased ability to aggregate with itself and with 
30 Sff 1 indicating that such aggregation^inding is important and that compounds that 

10873 V.2 .66. 



increase such protein interactions should ameliorate or treat SMA by increasing such 
important interaction(s). 

The invention further includes a composition comprising an isolated and 
purified SIPl wherein the composition further comprises a pharmaceutically- 
S acceptable carrier. Such a composition can be used to immunize a mammal in order to 
generate antibodies that specifically bind SDPl. Further, the composition can be used 
to administer SIPl to a mammal whereby a condition mediated by or associated with a 
decreased amount of SIPl is ameliorated or treated by the administration of the protein 
to the mammal. 

10 The invention includes a composition comprising an isolated purified 

SMN and an isolated purified SIPl. Such a novel composition, which is disclosed 
elsewhere herein, is a usefiil tool for the study the interaction between SMN and SIPl 
which is deficient in SMA patients thereby indicating that such protein-protein 
interaction is important in SMA. Further, the data disclosed herein demonstrate that 

15 SMN and SIPl associate in vivo as part of a large molecular weight complex of about 
800 kDa which complex is involved in mRNA biosynthesis. Thus, elucidation of the 
nature of the interactions between these proteins is important in the study of mRNA 
biosynthesis. In addition, the composition is useful in molecular protein modeling 
assays (e.g., chrystallography, biosensor analyses, and the like) to determine the 

20 precise configuration of the complex and further characterize the association kinetics of 
the complex and to aid in the elucidation of the precise mechanism(s) involved in 
mRNA biosynthesis. 

The invention also includes a composition comprising an isolated 
purified SMN and an isolated purified Gemin3. The data disclosed herein demonstrate, 

25 for the first time, that SMN and Gemin3 associate in vivo as part of a high molecular 

weight complex. Thus, the study of the interaction between SMN and Gemin3 is 
important in elucidating SMA disease processes and in studying the role(s) and 
interaction(s) of these proteins with respect to mRNA biosynthesis. In addition, the 
composition is useful in molecular protein characterization assays (e.g., 

30 chrystallography, biosensor analyses, and the like) to determine the precise 



10873 V.2 



-67- 



configuration of the proteins within the complex and to further characterize the 
association kinetics of the proteins and to aid in the elucidation of the precise 
mechanism(s) involved in mRNA biosynthesis. 

For administration to of the above-mentioned compositions to a manunal, a 
5 polypeptide, or the nucleic acid encoding it, or both, can be suspended in any 

pharmaceutically acceptable carrier, for example, HEPES buffered saline at a pH of 
about 7.8. Other pharmaceutically acceptable carriers which are useful include, but are 
not limited to, glycerol, water, saline, ethanol and other pharmaceutically acceptable 
salt solutions such as phosphates and salts of organic acids. Examples of these and 

10 other pharmaceutically acceptable carriers are described in Remington's 
Pharmaceutical Sciences (1991, Mack Publication Co., New Jersey), 

The pharmaceutical compositions may be prepared, packaged, or sold in 
the form of a sterile injectable aqueous or oily suspension or solution. This suspension 
or solution may be formulated according to the known art, and may comprise, in 

15 addition to the active ingredient, additional ingredients such as the dispersing agents, 
wetting agents, or suspending agents described herein. Such sterile injectable 
formulations may be prepared using a non-toxic parenterally-acceptable diluent or 
solvent, such as water or 1,3 -butane diol, for example. Other acceptable diluents and 
solvents include, but are not limited to. Ringer's solution, isotonic sodium chloride 

20 solution, and fixed oils such as synthetic mono- or di-glycerides. 

Pharmaceutical compositions that are useful in the methods of the 
invention may be administered, prepared, packaged, and/or sold in formulations 
suitable for oral, rectal, vaginal, parenteral, topical, pulmonary, intranasal, buccal, 
ophthalmic, or another route of administration. Other contemplated formulations 

25 include projected nanoparticles, liposomal preparations, resealed erythrocytes 

containing the active ingredient, and immunologically-based formulations. 

The compositions of the invention may be administered via numerous 
routes, including, but not limited to, oral, rectal, vaginal, parenteral, topical, 
pulmonary, intranasal, buccal, or ophthalmic administration routes. The route(s) of 

30 administration will be readily apparent to the skilled artisan and will depend upon any 



10873 V.2 



-68- 



number of factors including the type and severity of the disease being treated, the type 
and age of the veterinary or human patient being treated, and the like. 

Pharmaceutical compositions that are useful in the methods of the 
invention may be administered systemically in oral solid formulations, ophthahnic, 
suppository, aerosol, topical or other similar formulations. In addition to the 
compound such as heparan sulfate, or a biological equivalent thereof, such 
pharmaceutical compositions may contain pharmaceutically-acceptable carriers and 
other ingredients known to enhance and facilitate dmg administration. Other possible 
formulations, such as nanoparticles, liposomes, resealed erythrocytes, and 
immunologically based systems may also be used to administer SMN, alone or in 
combination with at least one protein specifically associated with SMN (e,g,^ another 
SMN, SIPl, Gemin3, and Sm proteins), and/or a nucleic acid encoding the same. 
In addition, any compound identified using any of the methods described herein can be 
formulated and administered to a mammal for treatment of SMA, and/or any disease 
or condition associated with a defect in mRNA biosynthesis. 

The invention encompasses the preparation and use of pharmaceutical 
compositions comprising a compound useful for treatment of SMA identified using a 
method of the invention as an active ingredient. Such a pharmaceutical composition 
may consist of the active ingredient alone, in a form suitable for administration to a 
subject, or the pharmaceutical composition may comprise the active ingredient and one 
or more pharmaceutically acceptable carriers, one or more additional ingredients, or 
some combination of these. The active ingredient may be present in the 
pharmaceutical composition in the form of a physiologically acceptable ester or salt, 
such as in combination with a physiologically acceptable cation or anion, as is well 
known in the art. 

The formulations of the pharmaceutical compositions described herein 
may be prepared by any method known or hereafter developed in the art of 
pharmacology. In general, such preparatory methods include the step of bringing the 
active ingredient into association with a carrier or one or more other accessory 



10873 V.2 



-69- 



ingredients, and then, if necessary or desirable, shaping or packaging the product into a 
desired single- or multi-dose unit. 

Although the descriptions of pharmaceutical compositions provided 
herein are principally directed to pharmaceutical compositions which are suitable for 
5 ethical administration to humans, it will be understood by the skilled artisan that such 
compositions are generally suitable for administration to animals of all sorts. 
Modification of pharmaceutical compositions suitable for administration to humans in 
order to render the compositions suitable for administration to various animals is well 
understood, and the ordinarily skilled veterinary pharmacologist can design and 

10 perform such modification with merely ordinary, if any, experimentation. Subjects to 
which administration of the pharmaceutical compositions of the invention is 
contemplated include, but are not limited to, humans and other primates, manraials 
including commercially relevant mammals such as cattle, pigs, horses, sheep, cats, and 
dogs. In addition, the administration of the compositions to birds is also contemplated. 

15 Pharmaceutical compositions that are usefiil in the methods of the 

invention may be prepared, packaged, or sold in formulations suitable for oral, rectal, 
vaginal, parenteral, topical, pulmonary, intranasal, buccal, ophthalmic, intrathecal or 
another route of administration. Other contemplated formulations include projected 
nanoparticles, liposomal preparations, resealed erythrocytes containing the active 

20 ingredient, and inununologically-based formulations. 

A pharmaceutical composition of the invention may be prepared, 
packaged, or sold in bulk, as a single unit dose, or as a plurality of single unit doses. 
As used herein, a "unit dose" is discrete amount of the pharmaceutical composition 
comprising a predetermined amount of the active ingredient. The amount of the active 

25 ingredient is generally equal to the dosage of the active ingredient which would be 
administered to a subject or a convenient fraction of such a dosage such as, for 
example, one-half or one-third of such a dosage. 

The relative amounts of the active ingredient, the pharmaceutical^ 
acceptable carrier, and any additional ingredients in a pharmaceutical composition of 

30 the invention will vary, depending upon the identity, size, and condition of the subject 

10873 V.2 7n 



treated and further depending upon the route by which the composition is to be 
administered. By way of example, the composition may comprise between 0. 1% and 
100% (w/w) active ingredient. 

In addition to the active ingredient, a pharmaceutical composition of the 
invention may further comprise one or more additional pharmaceutically active agents. 
Particularly contemplated additional agents include anti-emetics and scavengers such 
as cyanide and cyanate scavengers. 

Controlled- or sustained-release formulations of a pharmaceutical 
composition of the invention may be made using conventional technology. 
A formulation of a pharmaceutical composition of the invention suitable for oral 
administration may be prepared^ packaged, or sold in the form of a discrete solid dose 
unit including, but not limited to, a tablet, a hard or soft capsule, a cachet, a troche, or a 
lozenge, each containing a predetermined amount of the active ingredient. Other 
formulations suitable for oral administration include, but are not limited to, a powdered 
or granular formulation, an aqueous or oily suspension, an aqueous or oily solution, or 
an emulsion. 

As used herein, an "oily" liquid is one which comprises a carbon- 
containing liquid molecule and which exhibits a less polar character than water. 
A tablet comprising the active ingredient may, for example, be made by compressing 
or molding the active ingredient, optionally with one or more additional ingredients. 
Compressed tablets may be prepared by compressing, in a suitable device, the active 
ingredient in a free-flowing form such as a powder or granular preparation, optionally 
mixed with one or more of a binder, a lubricant, an excipient, a surface active agent, 
and a dispersing agent. Molded tablets may be made by molding, in a suitable device, 
a mixture of the active ingredient, a pharmaceutically acceptable carrier, and at least 
sufficient hquid to moisten the mixture. Pharmaceutically acceptable excipients used 
in the manufacture of tablets include, but are not limited to, inert diluents, granulating 
and disintegrating agents, binding agents, and lubricating agents. Known dispersing 
agents include, but are not limited to, potato starch and sodium starch glycoUate. 
Known surface active agents include, but are not limited to, sodium lauryl sulphate. 

10873 V.2 71 



Known diluents include, but are not limited to, calcium carbonate, sodium carbonate, 
lactose, microcrystalline cellulose, calcium phosphate, calcium hydrogen phosphate, 
and sodium phosphate. Known granulating and disintegrating agents include, but are 
not limited to, com starch and alginic acid. Known binding agents include, but are not 
5 limited to, gelatin, acacia, pre-gelatinized maize starch, polyvinylpyrrolidone, and 
hydroxy propyl methyl cellulose. Known lubricating agents include, but are not 
limited to, magnesium stearate, stearic acid, silica, and talc. 

Tablets may be non-coated or they may be coated using known methods 
to achieve delayed disintegration in the gastrointestinal tract of a subject, thereby 

10 providing sustained release and absorption of the active ingredient. By way of 

example, a material such as glyceryl monostearate or glyceryl distearate may be used to 
coat tablets. Further by way of example, tablets may be coated using methods 
described in U.S. Patents numbers 4,256,108; 4,160,452; and 4,265,874 to form 
osmotically-controUed release tablets. Tablets may further comprise a sweetening 

15 agent, a flavoring agent, a coloring agent, a preservative, or some combination of these 

in order to provide pharmaceutically elegant and palatable preparation. 

Hard capsules comprising the active ingredient may be made using a 
physiologically degradable composition, such as gelatin. Such hard capsules comprise 
the active ingredient, and may further comprise additional ingredients including, for 

20 example, an inert solid diluent such as calcium carbonate, calcium phosphate, or 
kaolin. 

Soft gelatin capsules comprising the active ingredient may be made 
using a physiologically degradable composition, such as gelatin. Such soft capsules 
comprise the active ingredient, which may be mixed with water or an oil mediiun such 
25 as peanut oil, liquid paraffin, or olive oil. 

Liquid formulations of a pharmaceutical composition of the invention 
which are suitable for oral administration may be prepared, packaged, and sold either 
in liquid form or in the form of a dry product intended for reconstitution with water or 
another suitable vehicle prior to use. 



10873 V.2 



-72- 



Liquid suspensions may be prepared using conventional methods to 
achieve suspension of the active ingredient in an aqueous or oily vehicle. Aqueous 
vehicles include, for example, water and isotonic saline. Oily vehicles include, for 
example, almond oil, oily esters, ethyl alcohol, vegetable oils such as arachis, olive, 
5 sesame, or coconut oil, fractionated vegetable oils, and mineral oils such as liquid 

paraffin. Liquid suspensions may further comprise one or more additional ingredients 
including, but not limited to, suspending agents, dispersing or wetting agents, 
emulsifying agents, demulcents, preservatives, buffers, salts, flavorings, coloring 
agents, and sweetening agents. Oily suspensions may further comprise a thickening 

10 agent. Known suspending agents include, but are not limited to, sorbitol syrup, 

hydrogenated edible fats, sodium alginate, polyvinylpyrrolidone, gimi tragacanth, gum 
acacia, and cellulose derivatives such as sodium carboxymethylcellulose, methyl 
cellulose, hydroxypropylmethylcellulose. Known dispersing or wetting agents include, 
but are not limited to, naturally-occurring phosphatides such as lecithin, condensation 

15 products of an alkylene oxide with a fatty acid, with a long chain aliphatic alcohol, 
with a partial ester derived from a fatty acid and a hexitol, or with a partial ester 
derived from a fatty acid and a hexitol anhydride (e.g., polyoxyethylene stearate, 
heptadecaethyleneoxycetanol, polyoxyethylene sorbitol monooleate, and 
polyoxyethylene sorbitan monooleate, respectively). Known emulsifying agents 

20 include, but are not limited to, lecithin and acacia. Known preservatives include, but 

are not limited to, methyl, ethyl, or n-propyl-para-hydroxybenzoates, ascorbic acid, and 
sorbic acid. Known sweetening agents include, for example, glycerol, propylene 
glycol, sorbitol, sucrose, and saccharin. Known thickening agents for oily suspensions 
include, for example, beeswax, hard paraffin, and cetyl alcohol. 

25 Liquid solutions of the active ingredient in aqueous or oily solvents may 

be prepared in substantially the same manner as liquid suspensions, the primary 
difference being that the active ingredient is dissolved, rather than suspended in the 
solvent. Liquid solutions of the pharmaceutical composition of the invention may 
comprise each of the components described with regard to liquid suspensions, it being 

30 understood that suspending agents will not necessarily aid dissolution of the active 



10873 V.2 



-73- 



ingredient in the solvent. Aqueous solvents include, for example, water and isotonic 
saline. Oily solvents include, for example, almond oil, oily esters, ethyl alcohol, 
vegetable oils such as arachis, olive, sesame, or coconut oil, fractionated vegetable oils, 
and mineral oils such as liquid paraffin. 

Powdered and granular formulations of a pharmaceutical preparation of 
the invention may be prepared using known methods. Such formulations may be 
administered directly to a subject, used, for example, to form tablets, to fill capsules, or 
to prepare an aqueous or oily suspension or solution by addition of an aqueous or oily 
vehicle thereto. Each of these formulations may fiirther comprise one or more of 
dispersing or wetting agent, a suspending agent, and a preservative. Additional 
excipients, such as fillers and sweetening, flavoring, or coloring agents, may also be 
included in these formulations. 

A pharmaceutical composition of the invention may also be prepared, 
packaged, or sold in the form of oil-in-water emulsion or a water-in-oil emulsion. The 
oily phase may be a vegetable oil such as olive or arachis oil, a mineral oil such as 
liquid paraffin, or a combination of these. Such compositions may fiirther comprise 
one or more emulsifying agents such as naturally occurring gums such as gum acacia 
or gum tragacanth, naturally-occurring phosphatides such as soybean or lecithin 
phosphatide, esters or partial esters derived from combinations of fatty acids and 
hexitol anhydrides such as sorbitan monooleate, and condensation products of such 
partial esters with ethylene oxide such as polyoxyethylene sorbitan monooleate. These 
emulsions may also contain additional ingredients including, for example, sweetening 
or flavoring agents. 

A pharmaceutical composition of the invention may be prepared, 
packaged, or sold in a formulation suitable for rectal administration. Such a 
composition may be in the form of, for example, a suppository, a retention enema 
preparation, and a solution for rectal or colonic irrigation. 

Suppository formulations may be made by combining the active 
ingredient with a non-irritating pharmaceutically acceptable excipient which is solid at 
ordinary room temperature {i.e., about 20°C) and which is liquid at the rectal 

10873 V.2 - 74 - 



temperature of the subject (i.e., about 37®C in a healthy human). Suitable 
pharmaceutically acceptable excipients include, but are not limited to, cocoa butter, 
polyethylene glycols, and various glycerides. Suppository formulations may further 
comprise various additional ingredients including, but not limited to, antioxidants and 
preservatives. 

Retention enema preparations or solutions for rectal or colonic irrigation 
may be made by combining the active ingredient with a pharmaceutically acceptable 
liquid carrier. As is well known in the art, enema preparations may be administered 
using, and may be packaged within, a delivery device adapted to the rectal anatomy of 
the subject. Enema preparations may further comprise various additional ingredients 
including, but not limited to, antioxidants and preservatives. 

A pharmaceutical composition of the invention may be prepared, 
packaged, or sold in a formulation suitable for vaginal administration. Such a 
composition may be in the form of, for example, a suppository, an impregnated or 
coated vaginally-insertable material such as a tampon, a douche preparation, or gel or 
cream or a solution for vaginal irrigation. 

Methods for impregnating or coating a material with a chemical 
composition are known in the art, and include, but are not limited to methods of 
depositing or binding a chemical composition onto a surface, methods of incorporating 
a chemical composition into the structure of a material during the synthesis of the 
material (/.e., such as with a physiologically degradable material), and methods of 
absorbing an aqueous or oily solution or suspension into an absorbent material, with or 
without subsequent drying. 

Douche preparations or solutions for vaginal irrigation may be made by 
combining the active ingredient with a pharmaceutically acceptable liquid carrier. As 
is well known in the art, douche preparations may be administered using, and may be 
packaged within, a delivery device adapted to the vaginal anatomy of the subject. 
Douche preparations may further comprise various additional ingredients including, 
but not limited to, antioxidants, antibiotics, antifungal agents, and preservatives. 



10873 V.2 



-75- 



As used herein, "parenteral administration" of a pharmaceutical 
composition includes any route of administration characterized by physical breaching 
of a tissue of a subject and administration of the pharmaceutical composition through 
the breach in the tissue. Parenteral administration thus includes, but is not limited to, 
5 administration of a pharmaceutical composition by injection of the composition, by 
application of the composition through a surgical incision, by application of the 
composition through a tissue-penetrating non-surgical wound, and the like. In 
particular, parenteral administration is contemplated to include, but is not limited to, 
subcutaneous, intraperitoneal, intramuscular, intrastemal injection, and kidney dialytic 

1 0 infusion techniques. 

Formulations of a pharmaceutical composition suitable for parenteral 
administration comprise the active ingredient combined with a pharmaceutically 
acceptable carrier, such as sterile water or sterile isotonic saline. Such formulations 
may be prepared, packaged, or sold in a form suitable for bolus administration or for 

15 continuous administration. Injectable fomiulations may be prepared, packaged, or sold 
in unit dosage form, such as in ampules or in multi-dose containers containing a 
preservative. Formulations for parenteral administration include, but are not limited to, 
suspensions, solutions, emulsions in oily or aqueous vehicles, pastes, and implantable 
sustained-release or biodegradable formulations. Such formulations may further 

20 comprise one or more additional ingredients including, but not limited to, suspending, 
stabilizing, or dispersing agents. In one embodiment of a formulation for parenteral 
administration, the active ingredient is provided in dry (i.e., powder or granular) form 
for reconstitution with a suitable vehicle (e.g., sterile pyrogen-free water) prior to 
parenteral administration of the reconstituted composition. 

25 The pharmaceutical compositions may be prepared, packaged, or sold in 

the form of a sterile injectable aqueous or oily suspension or solution. This suspension 
or solution may be formulated according to the known art, and may comprise, in 
addition to the active ingredient, additional ingredients such as the dispersing agents, 
wetting agents, or suspending agents described herein. Such sterile injectable 

30 formulations may be prepared using a non-toxic parenterally-acceptable diluent or 



10873 V.2 



•76- 



solvent, such as water or 1,3-butane diol, for example. Other acceptable diluents and 
solvents include, but are not limited to, Ringer's solution, isotonic sodium chloride 
solution, and fixed oils such as synthetic mono- or di-glycerides. Other parentally- 
administrable formulations which are usefiil include those which comprise the active 
ingredient in microcrystalline form, in a liposomal preparation, or as a component of a 
biodegradable polymer systems. Compositions for sustained release or implantation 
may comprise pharmaceutically acceptable polymeric or hydrophobic materials such as 
an emulsion, an ion exchange resin, a sparingly soluble polymer, or a sparingly soluble 
sah. 

Formulations suitable for topical administration include, but are not 
limited to, liquid or semi-liquid preparations such as liniments, lotions, oil-in-water or 
water-in-oil emulsions such as creams, ointments or pastes, and solutions or 
suspensions. Topically-administrable formulations may, for example, comprise fi-om 
about 1% to about 10% (w/w) active ingredient, although the concentration of the 
active ingredient may be as high as the solubility limit of the active ingredient in the 
solvent. Formulations for topical administration may further comprise one or more of 
the additional ingredients described herein. 

A pharmaceutical composition of the invention may be prepared, 
packaged, or sold in a formulation suitable for pulmonary administration via the buccal 
cavity. Such a formulation may comprise dry particles which comprise the active 
ingredient and which have a diameter in the range from about 0.5 to about 7 
nanometers, and preferably from about 1 to about 6 nanometers. Such compositions 
are conveniently in the form of dry powders for administration using a device 
comprising a dry powder reservoir to which a stream of propellant may be directed to 
disperse the powder or using a self-propelling solvent/powder-dispensing container 
such as a device comprising the active ingredient dissolved or suspended in a low- 
boiling propellant in a sealed container. Preferably, such powders comprise particles 
wherein at least 98% of the particles by weight have a diameter greater than 0.5 
nanometers and at least 95% of the particles by mmiber have a diameter less than 7 
nanometers. More preferably, at least 95% of the particles by weight have a diameter 

10873 V.2 -77- 



greater than 1 nanometer and at least 90% of the particles by number have a diameter 
less than 6 nanometers. Dry powder compositions preferably include a solid fine 
powder diluent such as sugar and are conveniently provided in a unit dose form. 

Low boiling propellants generally include liquid propellants having a 
boiling point of below 65^F at atmospheric pressure. Generally the propellant may 
constitute 50 to 99.9% (w/w) of the composition, and the active ingredient may 
constitute 0.1 to 20% (w/w) of the composition. The propellant may further comprise 
additional ingredients such as a liquid non-ionic or solid anionic surfactant or a solid 
diluent (preferably having a particle size of the same order as particles comprising the 
active ingredient). 

Pharmaceutical compositions of the invention formulated for pulmonary 
delivery may also provide the active ingredient in the form of droplets of a solution or 
suspension. Such formulations may be prepared, packaged, or sold as aqueous or 
dilute alcoholic solutions or suspensions, optionally sterile, comprising the active 
ingredient, and may conveniently be administered using any nebulization or 
atomization device. Such formulations may further comprise one or more additional 
ingredients including, but not limited to, a flavoring agent such as saccharin sodium, a 
volatile oil, a buffering agent, a surface active agent, or a preservative such as 
methylhydroxybenzoate. The droplets provided by this route of administration 
preferably have an average diameter in the range from about 0.1 to about 200 
nanometers. 

The formulations described herein as being useful for pulmonary 
dehvery are also useful for intranasal delivery of a pharmaceutical composition of the 
invention. 

Another formulation suitable for intranasal administration is a coarse powder 
comprising the active ingredient and having an average particle from about 0,2 to 500 
micrometers. Such a formulation is administered in the maimer in which snuff is 
taken, /.e., by rapid inhalation through the nasal passage from a container of the powder 
held close to the nares. 



10873 V.2 



-78- 



Formulations suitable for nasal administration may, for example, 
comprise from about as little as 0. 1% (w/w) and as much as 100% (w/w) of the active 
ingredient, and may further comprise one or more of the additional ingredients 
described herein. 

5 A pharmaceutical composition of the invention may be prepared, 

packaged, or sold in a formulation suitable for buccal administration. Such 
formulations may, for example, be in the form of tablets or lozenges made using 
conventional methods, and may, for example, 0.1 to 20% (w/w) active ingredient, the 
balance comprising an orally dissolvable or degradable composition and, optionally, 

10 one or more of the additional ingredients described herein. Alternately, formulations 
suitable for buccal administration may comprise a powder or an aerosolized or 
atomized solution or suspension comprising the active ingredient. Such powdered, 
aerosolized, or aerosolized formulations, when dispersed, preferably have an average 
particle or droplet size in the range from about 0.1 to about 200 nanometers, and may 

15 further comprise one or more of the additional ingredients described herein. 

A pharmaceutical composition of the invention may be prepared, 
packaged, or sold in a formulation suitable for ophthalmic administration. Such 
formulations may, for example, be in the form of eye drops including, for example, a 
0.1-1.0% (w/w) solution or suspension of the active ingredient in an aqueous or oily 

20 liquid carrier. Such drops may further comprise buffering agents, salts, or one or more 
other of the additional ingredients described herein. Other ophthalmalmically- 
administrable formulations which are useful include those which comprise the active 
ingredient in microcrystalline form or in a liposomal preparation. 

As used herein, "additional ingredients" include, but are not limited to, 

25 one or more of the following: excipients; surface active agents; dispersing agents; inert 
diluents; granulating and disintegrating agents; binding agents; lubricating agents; 
sweetening agents; flavoring agents; coloring agents; preservatives; physiologically 
degradable compositions such as gelatin; aqueous vehicles and solvents; oily vehicles 
and solvents; suspending agents; dispersing or wetting agents; emulsifying agents, 

30 demulcents; buffers; salts; thickening agents; fillers; emulsifying agents; antioxidants; 



10873 V.2 



-79- 



antibiotics; antifungal agents; stabilizing agents; and pharmaceutically acceptable 
polymeric or hydrophobic materials. Other "additional ingredients" which may be 
included in the pharmaceutical compositions of the invention are known in the art and 
described, for example in Genaro, ed. (1985, Remington's Pharmaceutical Sciences, 
5 Mack Publishing Co., Easton, PA), which is incorporated herein by reference. 

Typically dosages of the compoimd of the invention which may be 
administered to an animal, preferably a human, range in amount from 1 microgram to 
about 100 grams per kilogram of body weight of the animal. While the precise dosage 
administered will vary depending upon any number of factors, including but not 

10 limited to, the type of animal and type of disease state being treated, the age of the 

animal and the route of administration. Preferably, the dosage of the compound will 
vary from about 1 milligram to about 10 grams per kilogram of body weight of the 
animal. More preferably, the dosage will vary from about 10 milligrams to about 1 
gram per kilogram of body weight of the animal. 

15 The compound may be administered to an animal as frequently as 

several times daily, or it may be administered less frequently, such as once a day, once 
a week, once every two weeks, once a month, or even lees frequently, such as once 
every several months or even once a year or less. The frequency of the dose will be 
readily apparent to the skilled artisan and will depend upon any number of factors, such 

20 as, but not limited to, the type and severity of the disease being treated, the type and 
age of the animal, etc. 

Methods 

The invention includes a method of stimulating snRNP assembly. The 
method comprises contacting an extract comprising snRNP components (e.g., SMN, 
25 SmB, SmD, snRNA, and the like) with SMN. This method takes advantage of the 

novel discovery that SMN mediates snRNP assembly and is a component of the 
complex involved in snRNP assembly. 

The invention includes a method of identifying a compound which 
affects the level of SMN expression in a cell. The method comprises contacting a cell 
30 with a test compoimd and comparing the level of SMN expression in the cell with the 

10873 V.2 -80- 



level of SMN expression in an otherwise identical cell which is not contacted with the 
compound. A higher or lower level of SMN expression in the cell treated with the 
compound relative to the level of SMN expresison in an untreated cell indicates that 
the compound affects SMN expression. 
5 One skilled in the art would appreciate, based upon the disclosure 

provided herein, that such compound would be useful to treat SMA since lack or 
decreased expression of SMN in a cell is causally linked to SMA. Thus, a compound 
that affects SMN expression would be an important potential SMA therapeutic. 

The invention also includes a method of identifying compounds which 

10 affect the level of SMN in a cell wherein the cell comprises an SMN modulating 
sequence. The method comprises contacting the cell with a test compound and 
determining whether contact with the compound affects the expression of SMN in the 
cell compared with the level of SMN expression in an otherwise identical cell not 
contacted with the compound. Thus, the method allows identification of a compound 

15 that affects SMN expression. 

A compound that increases the level of expression of SMN compared to 
the level of SMN expression in the untreated cell is potentially useful for the treatment 
of SMA which is characterized by decreased levels of SMN in afflicted individuals. 
Thus, the invention also includes a method of identifying a compound useful for the 

20 treatment of SMA. In one aspect, the method comprises contacting a cell comprising 
an antisense nucleic acid complementary to a nucleic acid encoding SMN with a test 
compound. Then, the level of expression of SMN in the cell contacted with the 
compound is compared, using methods such as Northern blotting and the like, to the 
level of expression of SMN in an otherwise identical cell not treated with the 

25 compound. A higher level of SMN expression in the cell contacted with the compound 

compared with the level in the cell not contacted with the compound indicates that the 
compound increased SMN expression in the cell. Since decreased SMN expression is 
associated with SMA, compoimds that increase SMN expression are important 
potential therapeutics to treat the disease. Thus, a method of identifying a compound 

30 that affects the level of SMN expression in a cell is an important tool in the 

10873 V.2 oi 



development of SMA therapeutics since the disease is associated and apparently 
mediated by a defect in the level of SMN expressed and/or the ability of the SMN 
produced in the cell to participate in mRNA biogenesis. 

One skilled in the art would appreciate, based upon the disclosure 
5 provided herein, that until the present invention, there was no cell model of SMA 
available for in vitro testing of compounds or for studying the role of decreased or 
absent SMN expression in the SMA disease process. This cell line comprising an 
SMN antisense nucleic acid which exhibits decreased or absent SMN expression, is an 
important tool for the study of SMA and for the identification of compounds useful for 

10 treatment of the disease. 

Further, the present invention includes method of identifying a 
compound useful for treatment of SMA wherein the cell comprised an SMN 
modulating sequence that decreases the expression of endogenous SMN. The SMN 
modulating sequence, i.e., a nucleic acid encoding SMN, further comprises a tag 

1 5 polypeptide that renders the exogenous SMN fusion protein non-functional. Because 
expression of the non-functional SMN inhibits expression of functional endogenous 
SMN, the cell provides a useful system wherein potential SMA therapeutics can be 
identified and evaluated. Thus, the invention includes a method wherein a cell 
comprising an SMN modulating sequence which inhibits expression of endogenous 

20 SMN is contacted with a test compoimd. The level of SMN expression in the cell 
contacted with the compound is compared to the level of SMN expression in an 
otherwise identical cell not contacted with the compound. A higher level of expression 
of SMN in the cell contacted with the compoimd compared with the level of expression 
of the otherwise identical but xmtreated cell is an indication that the compoimd is useful 

25 for treatement of SMA 

One skilled in the art would also appreciate, based upon the disclosure 
provided herein, that a cell exhibiting decreased or absent expression of SMN would 
demonstrate altered growth characteristics due to the deleterious effect(s) of decreased 
levels of SMN expression. Therefore, another cell line was produced wherein the 

30 expression of SMN can be controlled thereby allowing the cells to grow and divide so 



10873 V.2 



-82- 



as to produce sufficient cells to permit testing of potential SMN candidate therapeutics. 
That is, once the cells reach sufficient numbers, the level of SMN can be selectively 
decreased or completely inhibited. 

Therefore, the invention includes a method of identifying a compound 
5 usefiil for treating SMA comprising contacting a cell that has decreased or absent 

levels of SMN expression thereby mimicking the disease condition of SMA. The cell 
comprises a knock-out targeting vector such that at least one or both copies of the 
SMN^ gene has/have been removed. The cell further comprises a copy of the SMN 
gene present within the cell on a vector (e.g., a plasmid) under the control of an 

10 inducible promoter (e.g., the tetracycline repressible promoter, and the like). The 

promoter is induced to allow the cells to grov/ and divide in culture. The promoter is 
turned off such that SMN is not expressed in the cell. The cell then becomes a test 
system which emulates the condition in the cells of patients afflicted with SMA. The 
cell is then used for the method of the invention in that the cell is contacted with a test 

15 compound. The ability of the compound to treat SMA can then be assessed by 

determining the growth characteristics of the cell. That is, a compound that mediates 
cell growth is a potential SMA therapeutic since the "block" in mRNA biosynthesis 
due to decreased SMN expression has been overcome as a result of the cell having 
been contacted with the compoimd. Further, the ability of the compound to increase 

20 SMN expression can also be assessed using methods of detecting SMN mRNA such as 
Northem blotting and RT-PCR assay, and the like. 

In one embodiment, a chicken cell (i.e., DT40 which is a pre^B 
lymphoid cell) was used. This cell was used because it has a high degree of 
homologous recombination making it a useful cell line for use of a knock-out targeting 

25 vector which replaced the nucleic acid sequence encoding SMN through homologous 
recombination. However, the cell line need not exhibit high degree of homologous 
recombination; rather, the cell need only exhibit a degree of homologous 
recombination which allows successful use of a knock-out targeting vector which 
requires such recombination to remove all or part of the desired target SMN sequence. 



10873 V.2 



-83- 



Likewise, although DT40 cell comprises only a single SMN allele, the 
present inven tion is not limited to using cells comprising a single SMN allele. Rather, 
the invention includes using cells with at least one SMN allele for the production of 
knock-out cell lines. 

5 The invention also includes a method of identifying an RNA splicing- 

modulating compound. As disclosed herein, splicing of RNA is diminished in the 
absence of SMN. Thus, the method of identifying an RNA spHcing-modulating 
compound comprises incubating an in vitro pre-mRNA processing extract in the 
absence of SMN and in the presence or absence of a test compound, and comparing the 

10 level of splicing of RNA in the extract in the presence of the test compound with the 

level of splicing of the RNA in the absence of the test compoimd, wherein a higher or a 
lower level of RNA splicing in the extract in the presence of the test compound, 
compared with the level of RNA splicing in the extract in the absence of the test 
compound, is an indication that the test compound is an RNA splicing-modulating 

15 compound. In one preferred embodiment, the RNA splicing modulation is 

enhancement of RNA splicing. RNA splicing assays are disclosed herein as are 
methods of assessing splicing and include Northem blotting to identify the various 
processing species resulting from the various steps in pre-mRNA processing. 

A compound that increases RNA-splicing is useful for increasing or 

20 decreasing such reactions in vitro. Such a compound would be useful for treating any 
disease or condition associated with or mediated by a decrease (eg., SMA) or increase 
in RNA-splicing. 

In one aspect, the invention includes a method of enhancing splicing of 
RNA. This method comprises incubating an in vitro pre-mRNA processing extract in 
25 the presence of SMN or a mutant or variant or a fragment thereof, thereby enhancing 
splicing of the RNA. As stated previously herein, a compound that enhances RNA 
splicing would be useful for methods requiring such RNA splicing. Further, such a 
compound would be a useful candidate therapeutic for treatment of a disease or 
condition associated with or mediated by a decrease in RNA splicing. 



10873 V.2 



-84- 



Further included in the invention is a method of assessing the presence 
or degree of SMA in a mammaL This method is useful to diagnose SMA and is also 
useful for assessing the progress of SMA or the efficacy of treatment in an SMA 
patient. The method comprises obtaining a biopsy comprising motor neurons from the 
5 manunal and assessing the number and morphology of gems in the motor neurons, 

wherein a lower number of gems in the motor neurons, compared with the number of 
gems in motor neurons obtained from an otherwise identical mammal which does not 
have SMA, is an indication that the mammal has SMA, and further wherein the 
absence of or the presence of a minimal number of gems in the mammal having SMA 

10 is directly related to the severity of the SMA in the mammal. Such a method would be 
useful in the diagnosis of SMA and in providing an assay for following up on the 
efficacy of SMA treatments. That is, the number of gems present in the neurons of 
SMA patients could be evaluated before, during and after treatment and the efficacy of 
the treatment could thus be assessed. Further, such a method allows a determination of 

1 5 the presence and/or the severity of the disease in a mammal. 

Further, the invention includes a method of assessing the level of 
binding of SMN obtained from a mammal to determine the presence or degree of 
SMA in a mammal. The method takes advantage of the disclosure provided herein that 
the SMN of SMA patients exhibits decreased binding with SMN-associated protein 

20 (^ g-j SMN, SIPl, and GeminS). Thus, the method compares the level of binding of 
SMN obtained from a mammal (such as by obtaining SMN from cultured cells or a 
biopsy obtained from the mammal to an SMN-associated protein with the level of 
binding of SMNwt with an identical SMN-associated protein. Such binding assay can 
be performed in vitro using the methods disclosed herein (e.g., immobilized GST-SIP 1 

25 or GST-Gemin3 is contacted with the SMN obtained from the patient). The amount of 

SMN specifically bound to the immobilized protein is determined using, for example, 
antibodies that bind specifically to SMN, but other methods of assessing the amount of 
bound SMN are included in the invention, such as using radiolabeled proteins to detect 
the amount bound to the surface. 



10873 V.2 



-85- 



The invention includes a method of identifying a test compound that 
affects binding of SMN with a protein that specifically binds with SMN (i.e., an SMN- 
associated protein such as another SMN, SEPl, Gemin3, SmB, SmB\ SmDl, SmD2, 
and SmD3, and the like). One skilled in the art, based upon the disclosure provided 
5 herein, would appreciate that the protein binding assays disclosed herein can be used to 
compare the binding of a labeled SMN-associated protein to immobilized SMN as 
described in the various embodiments exemplified herein. The method of 
immobiUzing the target protein and the identity of the SMN-associated protein or the 
label present thereon are not crucial and any number of labels (e.g., tag polypeptide 

10 epitope, radiolabels, and the like), surfaces (e.g., glutathione-Sepharose, nickel 

columns), and SMN-associated proteins (e.g., SMN, or a mutant^ variant or derivative 
thereof, SIPl, and Gemin3) can all be employed in the method. 

As discussed previously elsewhere herein, a compound that affects 
binding of SMN with a protein that specifically binds with SMN is an important 

15 candidate SMA therapeutic since the binding of SMN to its associated proteins in SMA 
patients has been demonstrated, by the data disclosed herein, to be affected. Thus, the 
instant method provides an important assay in the development of useful compounds 
for the treatment of SMA. 

The invention includes a method of assessing whether a test compound 

20 is useful for treatment of SMA. In essence the method identifies a test compound that 
increases binding of SMN with a protein that specifically binds with SMN (/.e., an 
SMN-associated protein such as another SMN, SIPl, GeminS, SmB, SmB*, SmDl, 
SmD2, and SmD3, and the like). One skilled in the art, based upon the disclosure 
provided herein, would appreciate that the protein binding assays disclosed herein can 

25 be used to compare the binding of a labeled SMN-associated protein to immobihzed 
SMN as described in the various embodiments exemplified herein. Thus, the method 
comprises making a preparation comprising the test compound, a labeled protein that 
specifically binds with SMN (e.g., another SMN, SIPl, Gemin3, SmB, SmB', SmDl, 
SmD2, and SmD3, and the like), and a surface comprising at least a portion of SMN 

30 bound thereon. Preferably, the portion of SMN bound thereon comprises a portion of 



10873 V.2 



-86- 



SMN that specifically binds the protein that specifically binds SMN used in the 
method. 

One skilled in the art would appreciate, based upon the disclosure 
provided herein, that the method of immobilizing the target protein and the identity of 
5 the SMN-associated protein or the label present thereon are not crucial factors in the 

assay such that any number of labels (eg., tag polypeptide epitope, radiolabels, and the 
like), surfaces (e.g., glutathione-Sepharose, nickel columns, immunoaffinity colunms), 
and SMN-associated proteins (e.g., SMN, or a mutant, variant or derivative thereof, 
SIPl, and GeminS) can all be employed in the method. 

10 As discussed previously elsewhere herein, a compound that affects 

binding of SMN with a protein that specifically binds with SMN is an im^portant 
candidate SMA therapeutic since the binding of SMN to its associated proteins in SMA 
patients has been demonstrated, by the data disclosed herein, to be affected. Thus, the 
instant method provides an important assay in the development of useful compounds 

1 5 for the treatment of SMA. 

A method of enhancing splicing of mRNA. The method comprises 
incubating an in vitro pre-mRNA processing extract in the presence of SMN, or any 
mutant, derivative, variant, and fi*agment thereof, thereby enhancing splicing of the 
mRNA. Such extract is disclosed herein and/or it can be any crude nuclear cell extract 

20 capable pre-mRNA processing. Further, the substrate used in the method can be any 
substrate that when processed yields a distinguishable processing product such that the 
activity of the extract can be assessed. Such substrates include the pre-mRNA 
processing substrates disclosed elsewhere herein as well as others well known in the 
art. 

25 The invention includes a method of identifying a compound that affects 

pre-mRNA splicing. The method comprises incubating an extract capable of pre- 
mRNA splicing such as those disclosed elsewhere hererin or other extracts well-known 
in the art, in the presence or absence of a test compound and comparing the level of 
pre-mRNA splicing in the in the absence and presence of the compound where a higher 



10873 V.2 



-87- 



or lower level of pre-mRNA splicing in the treated extract is an indication that the test 
compound affects pre-mRNA splicing. 

The invention further includes a method of identifying a test compound 
that is useful to treat SMA. The method comprises incubating an extract capable of 
5 pre-mRNA splicing in the presence or absence of a test compound and comparing the 
level of pre-mRNA spUcing in the extract in the presence of the test compound with the 
level of spUcing of pre-mRNA in the absence of the compound. A higher level of pre- 
mRNA splicing in the extract in the presence of the test compound compared with the 
level of pre-mRNA splicing in the extract not comprising the test compound, is an 

10 indication that the test compound is useful to treat SMA since decreased pre-mRNA 

splicing is associated with SMA as disclosed, for the first time, herein. Thus, a 
compound that increases pre-mRNA splicing, which is reduced in SMA, is a potential 
SMA therapeutic. 

Similarly, a compound that increases snRNP assembly, pre-mRNA 

15 splicing, SMN binding with an SMN associated protein (e.g., another SMN, SIPl, 
Gemin3, and the like), is useful for treatment of SMA in that each of these activities 
are causally linked to SMA. 

Thus, the invention includes a method of identifying a compound that 
affects snRNP assembly. The method comprises incubating an extract capable of 

20 snRNP assembly in the presence or absence of a test compound and comparing the 
level of snRNP assembly in the extract comprising the test compound to the level of 
snRNP assembly in the extract which does not contain the compound. A higher or 
lower level of snRNP assembly in the extract containing the compoimd to the extract 
not containing the compound is an indication that the test compound affects snRNP 

25 assembly. 

The invention further includes a method of identifying a test compound 
that is useful to treat SMA. The method comprises incubating an extract capable of 
snRNP assembly in the presence or absence of a test compound and comparing the 
level of snRNP assembly in the extract containg the test compound with the level of 
30 snRNP assembly in the extract without the compound. A higher level of snRNP 

10873 V.2 oo 



assembly in the extract containing the test compound compared with the level of 
snRNP assembly in the extract without it is an indication that the test compound is 
useful to treat SMA. 

The invention includes a method of assessing the presence or degree of 
5 SMA in a mammal. The method comprises comparing the level of binding of SMN 

obtained from the mammal to a protein that specifically binds with SMN with the level 
of binding of SMN wild type to an identical protein that specifically binds with SMN, 
wherein a lower level of binding of the SMN from the mammal to the protein that 
specifically binds with SMN compared with the level of binding of SMN wild type 

10 with the identical protein that specifically binds with SMN is an indication of the 

presence or degree of SMA in a mammal. This is because^ as the data disclosed herein 
demonstrate, the SMN of SMA patients exhibits decreased binding to itself, and to 
other SMN associated proteins, than wild type SMN. 

The invention fiuther includes a method of identifying a compound that 

15 affects SMN expression in a cell. The method comprises contacting as cell as 

described herein with a test compound and comparing the level of SMN expression in 
the cell with the level of SMN expression in an otherwise identical cell which is not 
contacted with the test compound. A higher or lower level of SMN expression in the 
cell contacted with the test compound compared with the level of SMN expression in 

20 the otherwise identical cell which is not contacted with the compound is an indication 
that the test compound affects SMN expression in a cell, thereby identifying a 
compoimd that affects SMN expression in a cell. 

Also included in the invention is a method of identifying a compound 
that is useful to treat SMA. This method comprises contacting a cell as described 

25 herein with a test compound and comparing the level of SMN expression in the cell 

with the level of SMN expression in an otherwise identical cell which is not contacted 
with the test compound. A higher level of SMN expression in the cell contacted with 
the test compound compared with the level of SMN expression m the therwise identical 
cell which is not contacted with the test compound is an indication that the test 



10873 V.2 



-89. 



compound increases SMN expression in a cell, thereby identifying a compound that is 
useful to treat SMA. 

In addition, there is included in the invention a method of identifying a 
compound useful for the treatment of SMA. This method comprises contacting a cells 
as described herein with a test compound and comparing the level of growth of the cell 
with the level of growth of an otherwise identical cell which is not contacted with the 
test compound. A higher level of growth of the cell contacted with the test compound 
compared with the level of growth of the cell which is not contacted with the 
compound is an indication that the compound is useful to treat SMA, 
Definitions 

As used herein, each of the following terms has the meaning associated 
with it in this section. 

The articles "a" and "an" are used herein to refer to one or to more than 
one (i.e., to at least one) of the grammatical object of the article. By way of example, 
"an element" means one element or more than one element. 

As used herein, the term "adjacent" is used to refer to nucleotide 
sequences which are directly attached to one another, having no intervening 
nucleotides. By way of example, the pentanucleotide 5'-AAAAA-3' is adjacent the 
trinucleotide 5'-TTT-3* when the two are connected thus: 5*-AAAAATTT-3' or 5 - 
TTTAAAAA-3', but not when the two are connected thus: 5'-AAAAACTTT-3'. 
As used herein, amino acids are represented by the full name thereof, by the three letter 
code corresponding thereto, or by the one-letter code corresponding thereto, as 
indicated in the following table: 



Full Name 


Three-Letter Code 


One-Letter Code 


Aspartic Acid 


Asp 


D 


Glutamic Acid 


Glu 


E 


Lysine 


Lys 


K 


Arginine 


Arg 


R 


Histidine 


His 


H 


Tyrosine 


Tyr 


Y 



10873 V.2 





i^ys 






A cti 


IN 


fill if'OTniTlO 


vjin 




Serine 


oer 


c 


1 nrconinc 


inr 


T 
1 


Lriycine 


Uriy 




AiHmne 


Ala 


A 

A 


Valine 


vai 


V 




Leu 


T 

L, 


Isoleucine 


He 


I 


Methionine 


Met 


M 


Proline 


Pro 


P 


Phenylalanine 


Phe 


F 


Tryptophan 


Trp 


W 



15 

As used herein, to "alleviate" SMA means reducing the severity of the 
symptoms of the disease or disorder. This includes, but is not limited to, increasing the 
level of binding of SMN with SIPl, Gemin3, and/or another SMN in a patient afflicted 
with SMA compared with the level of binding of SMN to these proteins in the patient 
20 prior to or in the absence of the method of treatment. 

By the term "altered growth characteristics," as the term is used herein, 
is meant any variation in growth exhibited by a cell compared with an otherwise 
identical cell into which the SMN modulating sequence has not been introduced. 

"Antisense" refers particularly to the nucleic acid sequence of the non- 
25 coding strand of a double stranded DNA molecule encoding a protein, or to a sequence 
which is substantially homologous to the non-coding strand. As defined herein, an 
antisense sequence is complementary to the sequence of a double stranded DNA 
molecule encoding a protein. It is not necessary that the antisense sequence be 
complementary solely to the coding portion of the coding strand of the DNA molecule. 
30 The antisense sequence may be complementary to regulatory sequences specified on 

10873 V.2 - 91 - 



the coding strand of a DNA molecule encoding a protein, which regulatory sequences 
control expression of the coding sequences. 

By "biological activity," as the term is used herein, is meant that the 
protein has the ability to interact with its associated protein(s) and effectuate its normal 
5 function(s) within the cell. In a preferred embodiment, the SMN linked to BSA retains 
its biological activity in that the protein retained its ability to bind avidly to SIP 1, as 
well as the ability to bind to the Sm proteins, and mediate formation of the Sm core 
domain and to participate in its role in mRNA biogenesis. Further, biological activity 
as it refers to any form or fragment of SMN, SIP 1, and Gemin3 means that the protein 
10 has the ability to bind to SMN, SIPl, and/or Gemin3 and/or to participate in mRNA 
biogenesis. 

By "complementary to a portion or all of the nucleic acid encoding 
SMN" is meant a sequence of nucleic acid which does not encode SMN protein. 
Rather, the sequence which is being expressed in the cells is identical to the non-coding 

1 5 strand of the nucleic acid encoding SMN and thus, does not encode SMN protein. 

The terms "complementary" and "antisense" as used herein, are not 
entirely synonymous. " Antisense" refers particularly to the nucleic acid sequence of 
the non-coding strand of a double stranded DNA molecule encoding a protein, or to a 
sequence which is substantially homologous to the non-coding strand. 

20 "Complementary" as used herein refers to the broad concept of subunit sequence 
complementarity between two nucleic acids, e.g., two DNA molecules. When a 
nucleotide position in both of the molecules is occupied by nucleotides normally 
capable of base pairing with each other, then the nucleic acids are considered to be 
complementary to each other at this position. Thus, two nucleic acids are 

25 complementary to each other when a substantial number (at least 50%) of 

corresponding positions in each of the molecules are occupied by nucleotides which 
normally base pair with each other (e.g., A:T and G:C nucleotide pairs). As defined 
herein, an antisense sequence is complementary to the sequence of a double stranded 
DNA molecule encoding a protein. It is not necessary that the antisense sequence be 

30 complementary solely to the coding portion of the coding strand of the DNA molecule. 

10873 V.2 QO 



The antisense sequence may be complementary to regulatory sequences specified on 
the coding strand of a DNA molecule encoding a protein, which regulatory sequences 
control expression of the coding sequences. 

A "coding region" of a gene consists of the nucleotide residues of the 
5 coding strand of the gene and the nucleotides of the non-coding strand of the gene 

which are homologous with or complementary to, respectively, the coding region of an 
mRNA molecule which is produced by transcription of the gene. 

A "coding region" of an mRNA molecule also consists of the nucleotide 
residues of the mRNA molecule which are matched with an anticodon region of a 

10 transfer RNA molecule during translation of the mRNA molecule or which encode a 
stop codon. The coding region may thus include nucleotide residues corresponding to 
amino acid residues which are not present in the mature protein encoded by the mRNA 
molecule (e,g., amino acid residues in a protein export signal sequence). 

"Encoding" refers to the inherent property of specific sequences of 

1 5 nucleotides in a polynucleotide, such as a gene, a cDNA, or an mRNA, to serve as 

templates for synthesis of other polymers and macromolecules in biological processes 
having either a defined sequence of nucleotides (i.e., rRNA, tRNA and mRNA) or a 
defined sequence of amino acids and the biological properties resulting therefrom. 
Thus, a gene encodes a protein if transcription and translation of mRNA corresponding 

20 to that gene produces the protein in a cell or other biological system. Both the coding 

strand, the nucleotide sequence of which is identical to the mRNA sequence and is 
usually provided in sequence listings, and the non-coding strand, used as the template 
for transcription of a gene or cDNA, can be referred to as encoding the protein or other 
product of that gene or cDNA. 

25 Unless otherwise specified, a "nucleotide sequence encoding an amino 

acid sequence" includes all nucleotide sequences that are degenerate versions of each 
other and that encode the same amino acid sequence. Nucleotide sequences that 
encode proteins and RNA may include introns. 

"Expression vector" refers to a vector comprising a recombinant 

30 polynucleotide comprising expression control sequences operatively linked to a 

10873 v,2 0-5 



nucleotide sequence to be expressed. An expression vector comprises sufficient cis- 
acting elements for expression; other elements for expression can be supplied by the 
host cell or in an in vitro expression system. Expression vectors include all those 
known in the art, such as cosmids, plasmids (e.g., naked or contained in liposomes) and 
5 viruses {e.g., retroviruses, adenoviruses, and adeno-associated viruses) that incorporate 
the recombinant polynucleotide. 

A first region of an oligonucleotide "flanks" a second region of the 
oligonucleotide if the two regions are adjacent one another or if the two regions are 
separated by no more than about 1000 nucleotide residues, and preferably no more than 

1 0 about 1 00 nucleotide residues. 

As used herein, the term "fi-agment" as applied to a nucleic acid, may 
ordinarily be at least about 20 nucleotides in length, typically, at least about 100 
nucleotides, more typically, fi^om about 100 to about 500 nucleotides, typically at least 
about forty contiguous amino acids, preferably at least about 500 to about 1,000 

1 5 nucleotides, even more preferably at least about 1 ,000 nucleotides to about 2,000 
nucleotides, yet even more preferably at least about 2,000 to about 3,500, and most 
preferably, the nucleic acid fragment will be greater than about 3,500 nucleotides in 
length. 

As used herein, the term "fragment" as applied to a polypeptide, may 
20 ordinarily be at least about seven contiguous amino acids, typically, at least about 

fifteen contiguous amino acids, more typically, at least about thirty contiguous amino 
acids, typically at least about forty contiguous amino acids, preferably at least about 
fifty amino acids, even more preferably at least about sixty amino acids and most 
preferably, the peptide fragment will be greater than about sixty contiguous amino 
25 acids in length. 

As applied to a protein, a "fragment" of SIPl is about 50 amino acids in 
length. More preferably, the fragment of a SIPl is about 100 amino acids, even more 
preferably, at least about 200, yet more preferably, at least about 300, even more 
preferably, at least about 400, yet more preferably, at least about 500, and more 
30 preferably, at least about 600 amino acids in length. 

10873 v,2 QA 



A "genomic DNA" is a DNA strand which has a nucleotide sequence 
homologous with a gene. By way of example, both a fragment of a chromosome and a 
cDNA derived by reverse transcription of a mammalian mRNA are genomic DNAs. 
"Homologous" as used herein, refers to the subunit sequence similarity between two 
5 polymeric molecules, e.g., between two nucleic acid molecules, e.g,, two DNA 

molecules or two RNA molecules, or between two polypeptide molecules. When a 
subunit position in both of the two molecules is occupied by the same monomeric 
subunit, e.g., if a position in each of two DNA molecules is occupied by adenine, then 
they are homologous at that position. The homology between two sequences is a direct 

10 function of the number of matching or homologous positions, e.g., if half (e.g., five 
positions in a polymer ten subunits in length) of the positions in two compound 
sequences are homologous then the two sequences are 50% homologous, if 90% of the 
positions, e.g., 9 of 10, are matched or homologous, the two sequences share 90% 
homology. By way of example, the DNA sequences 3'ATTGCC5' and 3TATGGC 

1 5 share 50% homology. 

As used herein, "homology" is used synonymously with "identity." 
In addition, when the term "homology" is used herein to refer to the nucleic acids and 
proteins, it should be construed to be applied to homology at both the nucleic acid and 
the amino acid levels. 

20 Percent identity of one polynucleotide or polypeptide with respect to 

another polynucleotide or polypeptide may be determined using any available 
algorithm, such as the BLAST program as described in Altschul et al. (1990, J. MoL 
Biol. 215:403-410). 

An "isolated nucleic acid" refers to a nucleic acid segment or fragment 

25 which has been separated from sequences which flank it in a naturally occurring state, 

e.g., a DNA fragment which has been removed from the sequences which are normally 
adjacent to the fragment, e.g., the sequences adjacent to the fragment in a genome in 
which it naturally occurs. The term also applies to nucleic acids which have been 
substantially purified from other components which naturally accompany the nucleic 

30 acid, e.g., RNA or DNA or proteins, which naturally accompany it in the cell. The 

10873 V.2 - 95 - 



term therefore includes, for example, a recombinant DNA which is incorporated into a 
vector, into an autonomously replicating plasmid or vims, or into the genomic DNA of 
a prokaryote or eukaryote, or which exists as a separate molecule (e.g. , as a cDNA or a 
genomic or cDNA fragment produced by PGR or restriction enzyme digestion) 
5 independent of other sequences. It also includes a recombinant DNA which is part of a 
hybrid gene encoding additional polypeptide sequence. 

In the context of the present invention, the following abbreviations for 
the commonly occurring nucleic acid bases are used. "A" refers to adenosine, "C" 
refers to cytidine, "G" refers to guanosine, "T" refers to thymidine, and "U" refers to 
10 uridine. 

"Mutants," "derivatives," and "variants" of the peptides of the invention 
(or of the DNA encoding the same) are peptides which may be altered in one or more 
amino acids (or in one or more base pairs) such that the peptide (or DNA) is not 
identical to the sequences recited herein, but has the same property as the peptides 

15 disclosed herein, in that the peptide has the property of binding to SMN, SIPl, Sm 
proteins and/or functioning in mRNA biogenesis. 

By describing two polynucleotides as "operably linked" is meant that a 
single-stranded or double-stranded nucleic acid moiety comprises the two 
polynucleotides arranged within the nucleic acid moiety in such a manner that at least 

20 one of the two polynucleotides is able to exert a physiological effect by which it is 
characterized upon the other. By way of example, a promoter operably linked to the 
coding region of a gene is able to promote transcription of the coding region. 
Preferably, when the nucleic acid encoding the desired protein further comprises a 
promoter/regulatory sequence, the promoter/regulatory is positioned at the 5' end of the 

25 desired protein coding sequence such that it drives expression of the desired protein in 
a cell. Together, the nucleic acid encoding the desired protein and its 
promoter/regulatory sequence comprise a "transgene." 

As used herein, the terai "pharmaceutically acceptable carrier" means a 
chemical composition with which the active ingredient may be combined and which, 

30 following the combination, can be used to administer the active ingredient to a subject. 

10873 v,2 -96- 



As used herein, the term "physiologically acceptable" ester or salt 
means an ester or salt form of the active ingredient which is compatible with any other 
ingredients of the pharmaceutical composition, which is not deleterious to the subject 
to which the composition is to be administered. 

As used herein, the term "promoter/regulatory sequence" means a 
nucleic acid sequence which is required for expression of a gene product operably 
linked to the promoter/regulatory sequence. In some instances, this sequence may be 
the core promoter sequence and in other instances, this sequence may also include an 
enhancer sequence and other regulatory elements which are required for expression of 
the gene product. The promoter/regulatory sequence may, for example, be one which 
expresses the gene product in a tissue specific manner. 

A "constitutive" promoter is a nucleotide sequence which, when 
operably linked with a polynucleotide which encodes or specifies a gene product, 
causes the gene product to be produced in a cell under most or all physiological 
conditions of the cell. 

An "inducible" promoter is a nucleotide sequence which, when operably 
linked with a polynucleotide which encodes or specifies a gene product, causes the 
gene product to be produced in a cell substantially only when an inducer which 
corresponds to the promoter is present in the cell. 

A "tissue-specific" promoter is a nucleotide sequence which, when 
operably linked with a polynucleotide which encodes or specifies a gene product, 
causes the gene product to be produced in a cell substantially only if the cell is a cell of 
the tissue type corresponding to the promoter. 

The term "expression of a nucleic acid " as used herein means the 
synthesis of the protein product encoded by the nucleic acid. 

The use of the term "DNA encoding" should be construed to include the 
DNA sequence which encodes the desired protein and any necessary 5* or 3' 
untranslated regions accompanying the actual coding sequence. 

By the term "positioned at the 5' end" as used herein, is meant that the 
promoter/regulatory sequence is covalently bound to the 5* end of the nucleic acid 

10873 V.2 07 



whose expression it regulates, at a position sufficiently close to the 5' start site of 
transcription of the nucleic acid so as to drive expression thereof. 

The direction of 5' to 3' addition of nucleotides to nascent RNA 
transcripts is referred to as the transcription direction. The DNA strand having the 
same sequence as an mRNA is referred to as the "coding strand"; sequences on the 
DNA strand which are located 5* to a reference point on the DNA are referred to as 
"upstream sequences"; sequences on the DNA strand which are 3' to a reference point 
on the DNA are referred to as "downstream sequences." 

A "portion" of a polynucleotide means at least at least about twenty 
sequential nucleotide residues of the polynucleotide. It is understood that a portion of a 
polynucleotide may include every nucleotide residue of the polynucleotide. 

A "polyadenylation sequence" is a polynucleotide sequence which 
directs the addition of a poly A tail onto a transcribed messenger RNA sequence. 

A "polynucleotide" means a single strand or parallel and anti-parallel 
strands of a nucleic acid. Thus, a polynucleotide may be either a single-stranded or a 
double-stranded nucleic acid. 

The term"nucleic acid" typically refers to large polynucleotides. 
The term "oligonucleotide" typically refers to short polynucleotides, generally, no 
greater than about 50 nucleotides. It will be understood that when a nucleotide 
sequence is represented by a DNA sequence (i.e.. A, T, G, C), this also includes an 
RNA sequence (z.e, A, U, G, C) in which "U" replaces "T." 
Conventional notation is used herein to describe polynucleotide sequences: the left- 
hand end of a single-stranded polynucleotide sequence is the 5 -end; the left-hand 
direction of a double-stranded polynucleotide sequence is referred to as the 5'- 
direction. 

"Primer" refers to a polynucleotide that is capable of specifically 
hybridizing to a designated polynucleotide template and providing a point of initiation 
for synthesis of a complementary polynucleotide. Such synthesis occurs when the 
polynucleotide primer is placed under conditions in which synthesis is induced, i.e., in 
the presence of nucleotides, a complementary polynucleotide template, and an agent 

10873 V.2 Qo 



for polymerization such as DNA polymerase. A primer is typically single-stranded, 
but may be double-stranded. Primers are typically deoxyribonucleic acids, but a wide 
variety of synthetic and naturally occurring primers are useful for many applications. 
A primer is complementary to the template to which it is designed to hybridize to serve 
5 as a site for the initiation of synthesis, but need not reflect the exact sequence of the 
template. In such a case, specific hybridization of the primer to the template depends 
on the stringency of the hybridization conditions. Primers can be labeled with, e.g., 
chromogenic, radioactive, or fluorescent moieties and used as detectable moieties. 

"Probe" refers to a polynucleotide that is capable of specifically 

10 hybridizing to a designated sequence of another polynucleotide. A probe specifically 
hybridizes to a target complementary polynucleotide, but need not reflect the exact 
complementary sequence of the template, hi such a case, specific hybridization of the 
probe to the target depends on the stringency of the hybridization conditions. Probes 
can be labeled with, e.g., chromogenic, radioactive, or fluorescent moieties and used as 

1 5 detectable moieties. 

By the term "promoter/regulatory sequence" is meant a DNA sequence 
which is required for expression of a nucleic acid operably linked to the 
promoter/regulatory sequence. In some instances, the promoter/regulatory sequence 
may function in a tissue specific manner, in that, the promoter/regulatory sequence is 

20 only capable of driving expression in a cell of a particular tissue type. In some 

instances, this sequence may be the core promoter sequence and in other instances, this 
sequence may also include an enhancer sequence and other regulatory elements which 
are required for expression in a tissue-specific manner. 

"Recombinant polynucleotide" refers to a polynucleotide having 

25 sequences that are not naturally joined together. An amplified or assembled 

recombinant polynucleotide may be included in a suitable vector, and the vector can be 
used to transform a suitable host cell. 

A recombinant polynucleotide may serve a non-coding function (e.g., 
promoter, origin of replication, ribosome-binding site, etc.) as well. 



10873 V.2 



-99- 



A "recombinant polypeptide" is one which is produced upon expression 
of a recombinant polynucleotide. 

"Polypeptide" refers to a polymer composed of amino acid residues, 
related naturally occurring structural variants, and synthetic non-naturally occurring 
analogs thereof linked via peptide bonds, related naturally occurring structural variants, 
and synthetic non-naturally occurring analogs thereof. Synthetic polypeptides can be 
synthesized, for example, using an automated polypeptide synthesizer. 

The term "protein" typically refers to large polypeptides. 

The term "peptide" typically refers to short polypeptides. 

Conventional notation is used herein to portray polypeptide sequences: 
the left-hand end of a polypeptide sequence is the amino-terminus; the right-hand end 
of a polypeptide sequence is the carboxyl-terminus. 

As used herein, the term "reporter gene" means a gene, the expression of 
which can be detected using a known method. By way of example, the Escherichia 
coli lacZ gene may be used as a reporter gene in a medium because expression of the 
lacZ gene can be detected using known methods by adding the chromogenic substrate 
o-nitrophenyl-)9-galactoside to the medium (Gerhardt et al, eds., 1994, Methods for 
General and Molecular Bacteriology, American Society for Microbiology, Washington, 
DC, p. 574). 

A "restriction site" is a portion of a double-stranded nucleic acid which 
is recognized by a restriction endonuclease. 

A portion of a double-stranded nucleic acid is "recognized" by a 
restriction endonuclease if the endonuclease is capable of cleaving both strands of the 
nucleic acid at the portion when the nucleic acid and the endonuclease are contacted. 

By the term "specifically binds," as used herein, is meant a compound, 
e.g., a protein, a nucleic acid, an antibody, and the like, which recognizes and binds a 
specific molecule, but does not substantially recognize or bind other molecules in a 
sample. 

A first oligonucleotide anneals with a second oligonucleotide "with high 
stringency" if the two ohgonucleotides anneal under conditions whereby only 

10873 V.2 .100. 



oligonucleotides which are at least about 75%, and preferably at least about 90% or at 
least about 95%, complementary anneal with one another. The stringency of 
conditions used to anneal two oligonucleotides is a function of, among other factors, 
temperature, ionic strength of the annealing medium, the incubation period, the length 
5 of the oligonucleotides, the G-C content of the oUgonucleotides, and the expected 
degree of non-homology between the two oligonucleotides, if known. Methods of 
adjusting the stringency of aimealing conditions are known {see, e.g., Sambrook et al., 
1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 
New York). 

10 By the term "SMN modulating sequence," as used herein, is meant a 

nucleic acid sequence which when introduced into a cell affects the level of expression 
of SMN protein. Such altered level of SMN expression may, but the invention does 
not require, cause the cell to exhibit altered growth characteristics or altered formation 
of snRNP complexes compared with an otherwise identical cell into which the nucleic 

1 5 acid sequence has not been introduced. The nucleic acid sequence may be a DNA 
molecule in either a sense or antisense orientation with respect to the SMN coding 
sequence or the nucleic acid sequence may be, inter alia, a ribozyme specific for 
human SMN. 

As used herein, the term "substantially pure" describes a compound, 
20 e.g., a nucleic acid, protein or polypeptide, which has been separated from components 
which naturally accompany it. Typically, a compound is substantially pure when at 
least about 10%, preferably at least about 20%, more preferably at least about 50%, 
still more preferably at least about 75%, even more preferably at least about 90%, and 
most preferably at least about 99% of the total material (by volimie, by wet or dry 
25 weight, or by mole percent or mole fraction) in a sample is the compound of interest. 

Purity can be measured by any appropriate method, e.g., by column chromatography, 
gel electrophoresis or HPLC analysis. 

A compound, e.g., a nucleic acid, a protein or polypeptide is also 
"substantially purified" when it is essentially fi-ee of naturally associated components 
30 or when it is separated fi-om the native contaminants which accompany it in its natural 

10873 V.2 ini 



state. Thus, a "substantially pure" preparation of a nucleic acid, as used herein, refers 
to a nucleic acid sequence which has been purified from the sequences which flank it in 
a naturally occurring state, e.g., a DNA fragment which has been removed from the 
sequences which are normally adjacent to the fragment in a genome in which it 
5 naturally occurs. 

Similarly, a "substantially pure" preparation of a protein or a 
polypeptide, as used herein, refers to a protein or polypeptide which has been purified 
from components with which it is normally associated in its naturally occurring state. 
A substantially pure peptide can be purified by following known procedures for protein 
10 purification, wherein an immunological, enzymatic or other assay is used to monitor 
purification at each stage in the procedure. Protein purification methods are well 
known in the art, and are described, for example in Deutscher et al. (1990, In: Guide to 
Protein Purification . Harcourt Brace Jovanovich, San Diego). 

As used herein, the term "transgene" means an exogenous nucleic acid 
15 sequence which exogenous nucleic acid is encoded by a transgenic cell or mammal. 

By the term "exogenous nucleic acid" is meant that the nucleic acid has 
been introduced into a cell or an animal using technology which has been developed 
for the purpose of facilitating the introduction of a nucleic acid into a cell or an animal. 

A cell that comprises a transgene is referred to as a "recombinant cell" 
20 Such a cell may be a eukaryotic cell or a prokaryotic cell. A gene which is expressed 
in a recombinant cell wherein the gene comprises a recombinant polynucleotide, 
produces a "recombinant polypeptide." 

By the term "SMN-associated protein" as used herein, is meant a 
protein that specifically binds with SMN. Such proteins include another SMN, an 
25 SIPl, a Gemin3, and various Sm proteins {e.g., SmB, SmB', SmDl, SmD2, and 

SmD3). 

By "tag" polypeptide is meant any protein which, when linked by a 
peptide bond to a protein of interest, may be used to locaUze the protein, to purify it 
from a cell extract, to immobilize it for use in binding assays, or to otherwise study its 
30 biological properties and/or function. A chimeric {i.e., fusion) protein containing a 

10873 V.2 mo 



"tag" epitope can be immobilized on a resin which binds the tag. Such tag epitopes and 
resins which specifically bind them are well known in the art and include, for example, 
tag epitopes comprising a plurality of sequential histidine residues (His6), which 
allows isolation of a chimeric protein comprising such an epitope on nickel- 
5 nitrilotriacetic acid-agarose, a hemagglutinin (HA) tag epitope allowing a chimeric 
protein comprising such an epitope to bind with an anti-HA-monoclonal antibody 
affmity matrix, a myc tag epitope allowing a chimeric protein comprising such an 
epitope to bind with an anti-myc-monoclonal antibody affinity matrix, a glutathione-S- 
transferase tag epitope, and a maltose binding protein (MBP) tag epitope, which can 

10 induce binding between a protein comprising such an epitope and a glutathione- or 

maltose-Sepharose column, respectively. Production of proteins comprising such tag 
epitopes is well known in the art and is described in standard treatises such as 
Sambrook et al, 1989, and Ausubel et al, supra. Likewise, antibodies to the tag 
epitope anti-HA, anti-myc antibody 9E10, and the like) allow detection and 

15 localization of the fiision protein in, for example, Western blots, ELISA assays, and 
immunostaining of cells. 

As used herein, to "treat" means reducing the frequency with which 
symptoms of SMA are experienced by a patient. 

By the term "vector" as used herein, is meant any plasmid or virus 

20 encoding an exogenous nucleic acid. The term should also be constmed to include 
non-plasmid and non- viral compounds which facilitate transfer of nucleic acid into 
virions or cells, such as, for example, polylysine compounds and the like. The vector 
may be a viral vector which is suitable as a delivery vehicle for delivery of the SMN, 
SIPl, Gemin3, or all of the aforementioned, protein or a nucleic acid encoding SMN, 

25 SIPl, GeminS to the patient, or the vector may be a non- viral vector which is suitable 

for the same purpose. Examples of viral and non-viral vectors for delivery of DNA to 
cells and tissues are well known in the art and are described, for example, in Ma et al. 
(1997, Proc. Natl. Acad. Sci. U.S.A. 94:12744-12746). Examples of viral vectors 
include, but are not limited to, a recombinant vaccinia virus, a recombinant adenoviras, 

30 a recombinant retrovirus, a recombinant adeno-associated virus, a recombinant avian 

10873 V.2 \M 



pox virus, and the like (Cranage et al, 1986, EMBO J. 5:3057-3063; International 
Patent Application No. WO94/17810, published August 18, 1994; International Patent 
Apphcation No. W094/23744, published October 27, 1994). Examples of non-viral 
vectors include, but are not limited to, liposomes, polyamine derivatives of DNA, and 
5 the like. 

A "knock-out targeting vector," as the term is used herein, means a 
vector comprising two nucleic acid sequences each of which is complementary to a 
nucleic acid regions flanking a target sequence of interest which is to be deleted and/or 
replaced by another nucleic acid sequence. The two nucleic acid sequences therefore 
10 flank the target sequence which is to be removed by the process of homologous 
recombination. 

The invention is further described in detail by reference to the following 
experimental examples. These examples are provided for purposes of illustration only, 
and are not intended to be limiting unless otherwise specified. Thus, the invention 
15 should in no way be construed as being limited to the following examples, but rather, 
should be construed to encompass any and all variations which become evident as a 
resuh of the teaching provided herein. 

Example 1: Association of SMN with its associated protein SIPl in a complex with 
spliceosomal snRNA proteins 
20 The experiments presented in this example may be siunmarized as 

follows. 

Previously, molecular characterization of the protein product of the 
SA^Ngmc that causes SMA, Le,, SMN, led to the discovery of a novel nuclear 
structure, called gem, in which SMN is concentrated (Liu and Dreyfuss, 1996, EMBO 

25 J. 15:3555-3565). Gems appear by size, number, and proximity to be related to coiled 

bodies, another subnuclear structure of unknown function. Previous studies suggested 
that gems, like coiled bodies, are involved in RNA metabolism, and, consistent with 
this, SMN was found to interact with several RNA binding proteins and possibly also 
with RNA directly (Liu and Dreyfuss, 1996, supra; Liu et al, 1996, Cold Spring 

30 Harbor Symp. Quant. Biol. 61 :689-697). However, the specific function of SMN 

10873 V.2 -104- 



remained obscure. The experiments described herein demonstrate that in a yeast two- 
hybrid screen using SMN as the bait, a novel protein, SIPl, has been identified. SIPl 
forms a stable complex with SMN in vivo and in vitro, and it co-localizes with SMN in 
gems and in the cytoplasm. 
5 The data disclosed herein indicate that SMN and SIPl function as a 

complex in vivo. For instance, the interaction of SMN and SIPl in vitro is resistant to 
1 M NaCl, suggesting that the proteins interact avidly. The 300 kDa complex that 
contains SMN and SIPl is stable even in 4 M urea. Also, SMN and SIPl can be 
coimmunoprecipitated with specific monoclonal antibodies. Further, a yeast two- 

10 hybrid screen using SIPl as the bait under high stringency conditions (15 mM 3- 

aminotriazole) isolated fi-om a human library only SMN clones. In addition, SMN and 
SIPl co-localize in gems, suggesting that these two proteins function together. 

Immunopurification of the 300 kDa complex demonstrated that it 
contains, besides SMN and SIPl, spUceosomal snRNP core proteins including B/B*, D, 

15 E, F, and G, the snRNP-specific protein Ul A, and several other unidentified proteins. 
Furthemiore, the data disclosed herein demonstrate that SMN interacts directly with 
several spliceosomal snRNP core Sm proteins, including B/B\ D 1-3, and E. These data 
suggest that the SMN-SIPl complex plays an important role in spliceosomal snRNP 
biogenesis and/or function. Although BLAST searches of sequence databases with the 

20 SIPl protein did not identify significant homology to any other proteins, visual 

inspection suggested a limited but significant homology between SIPl and the yeast 
protein Brrl (Noble and Guthrie, 1996, Genetics 143:67-80; 1996, EMBO J. 15:4368- 
4379). The sequence alignment of human SIPl and yeast Brrl is shown in Figure 1. 
Brrl has been shown to be involved in snRNP biogenesis in Saccharomyces cerevisiae; 

25 deletion of Brrl causes destabilization of newly synthesized spliceosomal U2 snRNA, 
and Brrl interacts genetically with the yeast Sm Dl protein (Noble and Guthrie, 1996, 
EMBO J. 15:4368-4379). Searching the yeast genome sequence for possible SMN 
homologs has not identified any proteins with significant similarity to SMN. 

SMN, because it can interact with SIPl and with the spliceosomal 

30 snRNP Sm proteins via different domains, can potentially serve as the key bridging 

10873 V.2 1 ci^ 



component to bring together the various components of the complex. It is therefore 
particularly interesting to note that many SMA patients have deletions or point 
mutations encompassing exons 6 and 7 of SMN, the region shown herein to be 
involved in binding of SMN to the Sm proteins. Further, several missense mutations in 
5 the region of SMN corresponding to the P2 peptide have recently been described 
(Hahnen et al., 1996, Hum. Mol. Genet. 4:1927-1933), and it will be of interest to 
determine if such mutations exhibit altered binding to the Sm proteins. 

The Materials and Methods used in the experiments presented in this 
example are now described. 
10 Yeast Two-Hybrid Screening 

The human HeLa cDNA library, yeast strains, and yeast plasmids 
pGBT9, pGADGH, pVA3, and pTDl were obtained from Clontech Laboratories, Inc. 
(Palo Alto, CA). The manipulation of yeast and the Ubrary screening were carried out 
according to the conditions suggested by the manufacturer. In brief, the coding region 
15 of SMN was cloned into the pGBT9 vector. The S. cerevisiae HF7c reporter strain was 
first transformed with the pGBT9-derived SMN construct and, subsequently, with the 
HeLa cDNA library. 

Approximately 6x10^ transformants were seeded on eight 150 mm 
plates containing synthetic medium lacking histidine, leucine, and tryptophan. His"^ 
20 colonies were grown on synthetic medium plates lacking leucine and tryptophan and 
were then assayed for P-galactosidase activity by filter assay as described by the 
manufacturer. Of 6 million transformants screened, 146 were His^ LacZ^ colonies. 
These positive colonies fell into ten groups. One of these groups had ten independent 
cDNA clones that all encoded identical sequences of SIP 1, The library plasmid was 
25 recovered from these clones into the Escherichia coli HBlOl strain. True positive 

clones were confirmed by their ability to transactivate HIS3 and LacZ reporters when 
cotransforming HF7c with pGBT9 containing SMN. 

Production of Monoclonal Antibodies to SIPl 

Anti-SIPl antibodies 2S7 and 2K9 were prepared by immimizing 
30 Balb/C mice with His6-tag SMN chimeric protein purified from nickel chelation 

10873 --2 .106- 



chromatography using a Novagen (Madison, WI) His-Bind buffer kit. Hybridoma 
production and screening and ascites fluid production were performed as previously 
described (Choi and Dreyfuss, 1984, J, Cell Biol. 99:1997-2004). 

Production of Proteins In Vitro 

The ^^[S]-labeled proteins were produced by an in vitro 
transcriptiontranslation reaction (Promega Biotech, Madison, WI) in the presence of 
^^[S]-methionine (Amersham, Arlington Heights, IL). His6-SMN fusion protein was 
expressed from a pET bacterial expression system in the E. coil strain 
BL21(DE3)pLysS and purified using nickel chelation chromatography using the same 
kit as described above. GST-SMN fusion protein was expressed from a GST 
expression vector pGEX-5X-3 (Pharmacia Biotech, Piscataway, NJ) in the coli 
strain BL21 and purified using glutathione-Sepharose provided by Pharmacia Biotech 
according to the manufacturer's protocol. 

In Vitro Protein-Binding Assavs 

Purified GST or GST fusion protein (2 micrograms) was incubated with 
cpm of the in vitro translated protein product and 25 ^il of glutathione-Sepharose 
beads in 500 ^il of binding buffer (50 mM Tris-HCl [pH 7.5],2 mM EDTA, 0.1 % 
NP40, 2 micrograms per milliliter leupeptin and pepstatin A, and 0.5% aprotinin) 
containing different salt (NaCl) concentrations. Following incubation for 30 minutes at 
4°C, the resin was pelleted, washed with binding buffer, and the bound fraction was 
eluted by boiUng in SDS-PAGE sample buffer. The proteins were analyzed by SDS- 
PAGE, and they were visualized by fluorography. For the binding experiments 
described in Figure 5, 200 to 300 ng of GST or GST-SMN fusion protein was bound to 
30 \i\ of glutathione-Sepharose beads dissolved in phosphate buffered saline (pH 7.4) 
and incubated with "^^[SJ-labeled in vitro translated protein (approximately 1 x 10^ to 2 
X 10^ cpm/assay) for 30 minutes at 4''C. The beads were subsequently washed six 
times with phosphate buffered saline/350 mM NaCl (pH 7.4), and the bound protein 
fraction was eluted by boiling in SDS-PAGE sample buffer. 
Cell Culture and Treatments 



10873 V.2 



-107- 



HeLa cells and NIH 3T3 cells were cultured in Dulbecco's modified 
Eagle's medium (DMEM; GIBCO BRL, Gaithersburg, MD) supplemented with 10% 
fetal calf serum (FCS, GIBCO BRL). Low temperature incubations were carried out as 
follows. HeLa cells were shifted to 32°C and incubated for 24 hours prior to fixation 
5 and permeabihzation of the cells for immunostaining. For actinomycin D treatment, 
HeLa cells were incubated with 5 micrograms per milliliter of actinomycin D for 3 
hours before fixation for immunostaining. For in vivo labeling with ^^[S]methionine, 
50% confluent HeLa cells growing in 100 mm plates were incubated with 10 

35 

microCuries per milliliter of [S]methionine in DMEM without methionine and 
10 supplemented with 10% FCS overnight before homogenization for 
immunoprecipitation. 

Immunoprecipitation and Immunoblotting 

Immunoprecipitation of in vitro translated SIPl protein was carried out 
in the presence of 1% Empigen BB buffer as previously described by Choi and 

15 Dreyfliss (1984, J. Cell Biol. 99:1997-2004). Immunoprecipitation and purification of 
the SMN complex was carried out using total HeLa cell lysate in the presence of 0.5% 
TritonX-100 as previously described in Pinol-Roma et al. (1988, Genes Dev. 2:215- 
227), For immunoblotting, proteins were resolved on a 12.5% SDS-polyacrylamide 
gel and transferred to nitrocellulose membrane (Schleicher and Schuell, Inc., Keene, 

20 NH) using a BioTrans Model B Transblot apparatus (Gelman Sciences, Ann Arbor, 
MI) according to the instructions of the manufacturer. Filters were incubated in 
blotting solution (phosphate buffered saline, 5% nonfat milk) for at least 1 hour at 
room temperature, rinsed with cold phosphate buffered saline, and then incubated with 
primary antibody for at least 1 hour at room temperature. The fihers were washed 

25 three times in phosphate buffered saline containing 0.1% Tween 20, and bound 

antibodies were detected using the peroxidase-conjugated goat anti-mouse IgG plus 
IgM (Jackson ImmunoResearch Laboratories, Bar Harbor, ME). The protein bands 
were visualized using an ECL Western blotting detection kit (Amersham, Arlington 
Heights, IL) after washing the filters three times in phosphate buffered saline 

30 containing 0. 1 % Tween 20. 

10873 V.2 . 108- 



Immunofluorescence Microscopy 

Immunofluorescence microscopy was carried out essentially as 
previously described (Choi and Dreyfuss, 1984, J. Cell. BioL 99:1997-2004) with the 
following modifications. Primary monoclonal antibodies 2B1 and 1816 were diluted 
1:1000 in phosphate buffered saline containing 3% bovine serum albumin (BSA). The 
incubation with the first and second antibody was at room temperature for 1 hour. In 
double-label immunofluorescence experiments, a mixture of primary or secondary 
antibodies was incubated at the same time. Laser confocal fluorescence microscopy 
was performed with a Leica TCS 4D (Germany) confocal microscope. The antibodies 
used for these experiments were as follows: antibody against p80-coilin; monoclonal 
antibody Pd and rabbit polyserum R288; antibody against fibrillarin: human 
autoimmune antibody 1881. Rabbit polyclonal antibody against SMN was raised 
against exon 7 for SMN protein by Quality Controlled Biochemicals, Hopkinton, MA 
and the polyclonal antibodies were affinity purified. 

HeLa Cell Fractionation and Chromatographv 

HeLa cells were fractionated according to the methods described by 
Dignam et al. (1983, Nuc. Acids Res. 11:1475-1489). Fractionation of the nuclear or 
cytoplasmic SI 00 fraction was carried out as follows. The nuclear fraction (200 \il of 
approximately 20 milligrams per milliliter protein) in buffer D (20 mM Tris-HCI [pH 
7.4], 0.1 mM EDTA, 1 mM DTT, 20% glycerol, 500 mM KCl) or SlOO flection (400 
microliters of approximately 20 milligrams per milliliter protein) in buffer F (20 mM 
Tris-HCI [pH 7.4], 0.1 mM EDTA, 1 mM DTT, 10% glycerol, 500 mM KCl) was 
loaded onto a TSK-GEL G3000-SW glass column (TosoHaas, Montgomeryville, PA). 
The column was then washed with buffer A (20 mM Tris-HCI [pH 7.4], 200 mM 
NaCl, 2,5% glycerol) with or without 4 M urea at 0.25 ml/min flow rate. Fractions (0.5 
ml) were collected, and 15 jil of each fraction was analyzed on an SDS-PAGE. 

Far Western Analyses Using ^^FSISMN 

In vitro purified snRNP proteins were analyzed by SDS-PAGE, and the 
proteins were then transferred onto a nitrocellulose membrane as described previously 
herein. The nitrocellulose membrane was incubated in blotting solution (phosphate 

10873 V.2 mo 



buffered saline, 5% nonfat milk) for at least 1 hour at room temperature, rinsed with 
cold phosphate buffered saline, and then incubated with in vitro translated 
^^[S]methiomne-labeled SMN (2X10^ cpm) for 2 hours at room temperature. The 
nitrocellulose membrane was washed three times in phosphate buffered saline 
containing 0.05% NP40, and bound SMN was detected by exposing the membrane to 
X-ray film. 

Preparation of BSA-Peptides Coniugate.'; 

BSA-peptide conjugates were prepared as described in Fischer et al. 
(1995, Cell 82:475-483). In brief, peptides starting with a N-terminal cysteine and 
consisting of amino acids 13-44 (CRRGAGQSDDSDIWDDTALIKAYDKAVS) and 
240-267 (CEDDEALGSMLISWYMSGYHTGYYLGLKQ) of human SMN or the 
HIV-1 Rev NES (CLPPLERLTL) (Fischer et al., 1995, supra) were cross-linked to 
BSA using sulfo-SMCC (Pierce Chemical Co., Rockford, IL) as a cross-linker. The 
cross-linked BSA conjugates were sq)arated &om unreacted pqjtides and cross-linking 
reagent by dialysis against phosphate buffered saline (pH 7.4) and the conjugates were 
concentrated using a Centricon30 concentrator (Amicon, Bedford, MA). 

The Results of the experiments presented in this example are now 

described. 

SIPl. a Novel SMN-Interactinp Protein 

Using SMN as a bait in a yeast two-hybrid screai of a HeLa cDNA 
library, ten independent partial cDNA clones with insert sizes ranging from about 1 kb 
to about 1.3 kb were isolated, all of which contained the same open reading frame. The 
longest of these clones, designated 7-10, contained an insert of approximately 1.3 kb 
that was completely sequenced. Using the BLAST search program to search the 
GenBank database, an EST (clone #Z64761) (Cross et al., 1994, Nature Genet. 6:236- 
244) that is identical to the 5' end of clone 7-10 and which extends further upstream 
was identified. Conceptional translation of this cDNA revealed another potential 
methionine 24 amino acids upstream of the first methionine of clone 7-10. 
Immediately upstream of this methionine was a stop codon. It is not clear which 
methionine is the actual initiation methionine for the full-length cDNA SIPl . The 3'- 

10873 V.2 iin 



untranslated region is very AU-rich and contains a putative polyadenylation site 
AAUAAA. Thus, this is likely the full-length cDNA clone for the novel protein of the 
invention that interacts with SMN and which has been term SIPl . The predicted amino 
acid sequence of SIPl, along with the sequence of the Xenopus laevis homolog that is 
also disclosed herein, is presented in Figure 1. The nucleic acid sequence of SIPl 
disclosed herein encodes a protein of approximately 279 amino acids (including the 
potential 24 amino acids predicted by the EST sequence) with a calculated molecular 
mass of 32 kDa and pi of 5.3. 

To examine the interaction of SIPl with SMN and to characterize SIPl 
further, monoclonal antibodies to the SIPl protein were generated by immunizing mice 
with purified recombinant 6His-tag SIPl (starting with the second methionine) 
chimeric polypeptide. Two of these monoclonals, 2E17 and 2S7, were further 
characterized in detail and were shown to react with SIPl specifically by both 
immunoprecipitation and by Western blotting. Monoclonal antibody 2E17 reacted 
with a protein of similar size inXenopus\ thus, using this as the primary antibody, a 
Xenopus oocyte cDNA library was screened and tYi^ Xenopus homolog of SIP 1 was 
isolated. The predicted amino acid sequence of Xenopus SIPl is presented in Figure 1. 
Interestingly, all of the eight clones obtained by screening the Xenopus library with 
2E17 monoclonal antibody lack the first 24 amino acids that are potentially found in 
the human EST clone but also missing fi-om the clones obtained by screening the 
human library with this antibody. Xenopus SIPl is highly similar to human SIPl, the 
two proteins being approximately 90% homologous in amino acid sequence (see 
Figure 1). BLAST searches did not reveal significant homology to any other protein in 
the databases. However, a yeast protein, termed Brrl, appears to have significant 
similarity to SIPl by visual sequence comparison (Figure 1). 

SIPl Interacts with SMN/^z Vitro and/w Vivo 
The yeast two-hybrid results were confirmed by assessing the 
interaction of SIPl with SMN both in vitro and in vivo in HeLa. For the in vitro 
binding assay, SMN was expressed as a chimeric fusion protein with a glutathione S- 
transferase (GST) tag polypeptide. SIPl was produced and labeled with 

10873 V.2 111 



[S]methionine by in vitro transcription and translation in rabbit reticulocyte lysate. 
GST or GST-SMN proteins immobilized on glutathione-Sepharose were incubated 
with -^^[SJ-labeled SPl protein. Following washing at various salt concentrations (200 
mM to 1 M), proteins that were still bound to either GST or GST-SMN immobilized on 
5 glutathione-Sepharose were dissociated by boiling in SDS-containing sample buffer, 
and the eluted material was analyzed by SDS-PAGE. As shown in Figure 2A, full- 
length SIPl bound specifically to immobilized GST-SMN but not to immobiUzed GST 
alone. This bipding appeared to be very avid because it was not disrupted at 1 M NaCl. 

Immunoprecipitation experiments were performed to examine if SMN 

10 and SIPl interact in vivo, Anti-SMN monoclonal antibody 2B1 (Liu and Dreyfuss, 

1996, EMBO J. 15:3555-3565) was used to immunoprecipitate SMN from, total HeLa 
cell extract. The immunoprecipitates were then resolved by SDS-PAGE and 
immunoblotted with monoclonal antibody 2S7 specific for SIPl. As shown in Figure 
2B (lane designated "2B1 IP"), 2S7 readily detected SIPl in the 2B1 

15 immunoprecipitates, indicating that SIPl was coimmunoprecipitated with SMN. In a 
reciprocal experiment, the SMN protein was also coimmimoprecipitated by the anti- 
SIPl monoclonal antibody 2S7 (Figure 2B, lane designated "2S7 IP"). These results 
were confirmed using other monoclonal antibodies to SMN and to SIPl . As shown in 
Figure 2, there was no crossreactivity between the anti-SMN and anti-SIPl monoclonal 

20 antibodies. These results indicate that SMN and SIPl are associated in vivo and that 
they can be coimmunoprecipitated by either anti-SMN or anti-SIPl monoclonal 
antibodies. 

Gel filtration experiments further confirmed the existence in vivo of a 
complex containing both SIPl and SMN. HeLa nuclear and cytoplasmic SI 00 extracts 

25 were fractionated on a high performance gel filtration column, TSK-GEL G3000-SW, 
and each fraction was subjected to SDS-PAGE. SMN and SIPl were detected in the 
column fractions by immunoblotting with specific monoclonal antibodies. Figure 2C 
depicts the results of the cytoplasmic fractionation experiments. SMN and SIPl 
comigrate, as a peak of approximately 300 kDa, suggesting that they are part of a large 

30 macromolecular complex. The observed size suggests that this complex contains either 

10873 V.2 no 



multiple copies of the SMN and SIPl proteins and/or additional components. This 
complex is very stable, as it resists dissociation by 4 M urea, and it is observed in both 
the nuclear and cytoplasmic fractions. 

SIPl and SMN Co-locahze in Gems in the Nucleus and in the 

5 Cytoplasm 

Indirect immunofluorescence microscopy using the anti-SIPl 
monoclonal antibodies 2E17 and 2S7 was performed on HeLa cells to determine the 
cellular locaUzation of SIPl. Figures 3A and 3B illustrate that SIPl is found 
throughout the cytoplasm and by also displays intense staining of prominent discrete 

10 bodies in the nucleus as assessed using standard light microscopy immunofluorescence. 
This pattern is very similar to that seen for the SMN protein (Liu and Dreyfuss, 1996, 
EMBO J. 15:3555-3565), except that the nucleoplasmic staining of SIPl is somewhat 
stronger than that seen for SMN. 

In order to determine if the intensely staining nuclear stmctures are 

15 gems or coiled bodies, double-label laser confocal immunofluorescence experiments 
were performed using antibodies against p80-coilin (Figure 3C), fibrillarin (Figure 
3D), snRNP proteins (Y12, Figure 3E) found in coiled bodies, and SMN (Figure 3F) 
found in gems. Figures 3C, 3D, and 3E depict the results of the double labeling 
experiments using the anti-coiled body antibodies and the anti-SIPl 2S7. The nuclear 

20 structures that contain SIPl were clearly different from coiled bodies, but the two 
bodies were, in most cases, closely associated. However, the staining with the anti- 
SIPl monoclonal antibody 2S7 and a rabbit serum raised against exon 7 of SMN 
demonstrate that SMN and SIPl completely colocalized in gems (Figure 3F). The 
weak signal in the cytoplasm makes it impossible to determine whether SMN and SIPl 

25 also completely colocalized in the cytoplasm. However, co-locaHzation of SMN and 

SIPl is very likely because of the tight association of SMN with SIPl described herein. 
The co-localization of SMN with SIPl further supports the conclusion that these two 
proteins exist as a complex in the cell. SIPl is thus the second constituent of gems 
described thus far. 

30 The SMN-SIPl Complex Contains Soliceosomal snRNP Proteins 



10873 V.2 



-113- 



The observation that SMN and SIPl are associated in a large 
(approximately 300 kDa) complex prompted experiments to identify any possible 
additional components in this complex. In order to identify additional components of 
the SMN-SIPl complex, immunoprecipitations using anti-SMN and anti-SIPl 
5 monoclonal antibodies on ^^[S]-labeled HeLa cell lysates were performed, and the 

immunoprecipitated proteins were then analyzed by SDS-PAGE. As shown in Figure 
4A, similar patterns were obtained using anti-SMN and anti-SIPl monoclonal 
antibodies to immunoprecipitate complex-associated proteins. Several proteins were 
specifically coimmunopurified by anti-SMN and anti-SIPl antibodies. Besides SMN 

10 and SIPl proteins, there was a prominent doublet at approximately 97 kDa, the group 
of proteins of approximately 28 kDa, and the group of proteins of approximately 15 
kDa. This protein complex was quite stable, since it was resistant to SDS/Triton 
XlOO/deoxycholate-containing buffer and to a high sah wash (500 mM NaCl). As a 
reference for these immunoprecipitations (and for reasons discussed herein), a lane was 

15 included in the gel demonstrating an immunoprecipitation v/ith the monoclonal 

antibody Y12 (Figure 4A, lane designated "Y12") which antibody is directed against 
the Sm proteins common to spliceosomal snRNPs (Lemer and Steitz, 1979, Proc. Natl. 
Acad, Sci. USA 71:5495-5499; Lemer et al., 1981, Proc. Natl. Acad. Sci. USA 
78:2737-2741). To characterize this complex further, the protein samples were 

20 subjected to high resolution, two-dimensional nonequilibrium pH gradient gel 
electrophoresis (NEPHGE). Figure 4B depicts the major proteins that were 
specifically found in the anti-SMN (2B1) isolated complex but not in control (SP2/0) 
inununoprecipitations. The major proteins selectively immunoprecipitated by anti- 
SMN (2B1) are labeled as SIPl, SIP2, SIPS, and SIP4. Further, the group of basic, low 

25 molecular weight proteins in the anti-SMN immunoprecipitate exhibited the same 

pattern as some of the Sm proteins in immunopurified snRNPs. For direct comparison, 
anti-Sm (Y12) immunoprecipitate from HeLa nuclear fi-actions was analyzed in 
parallel, and Ul A, Sm B/B\ Dl-3, E, F, and G proteins of snRNPs migrated in exactly 
the same pattern as those proteins in the SMN-SIPl complex. Immunoblotting 

30 experiments using monoclonal antibodies against the Ul snRNP-specific protein ULA 



10873 V.2 



-114- 



and the anti-Sm monoclonal antibody Y12 confirmed that these proteins were indeed 
the spliceosomal snRNP proteins. Immunoprecipitations using Y12 starting with either 
total HeLa extract or that derived from nucleoplasm also demonstrated that SMN and 
SIPl could be detected in Y12 immunoprecipitates by immunob lotting. The 
5 immunoprecipitations shown in Figure 4 were carried out using nucleoplasm as the 

starting material. Similar results, although with considerably higher background, were 
obtained using cytoplasmic or whole cell extracts, and predigestion with RNases did 
not reduce the signal. This suggests the SMN-SIPl-Sm protein complexes can be 
found both in the nucleus and in the cytoplasm. It has not been definitively determined 

1 0 whether the immunoprecipitated SMN-SIP 1 -Sm complexes contain snRNAs since, 
e.g., the RNase resistance of the complexes may simply be due to the fact that the 
RNAs were not accessible to RNase. However, experiments described herein 
demonstrate that SMN and SIPl immunoprecipitated labeled snRNAs suggesting the 
complex may contain such ribonucleic acids. 

15 The SMN Protein Interacts with Sm BIB\ Sm D. and Sm E Proteins 

Directlv 

Most of the snRNAs in snRNP complexes are resistant to RNase 
digestion and this, therefore, makes it difficult to determine if the SMN-SIP 1 -snRNP 
protein complexes result from protein-protein or protein-RNA interactions. To 

20 investigate whether SMN interacts with snRNP proteins directly, in vitro translated 

^^[S]methionine-labeled SMN was used in a far- Western blot assay to probe SDS- 
PAGE-resolved proteins of purified snRNPs. The protein composition of purified 
snRNPs is shown in Figure 5 (lane designated "snRNPs"). The result of probing these 
proteins with ^^[S]SMN (Figure 5, lane designated " ^^[S]SMN") indicates that SMN 

25 specifically and directly bound to SmB/B' and also to one or several of the Sm D 

proteins. When the same experiments were performed with in vitro translated SIPl, no 
specific binding of SIPl to snRNP proteins was detected although, in the same assay, 
SIPl bound strongly to recombinant SMN protein. These findings suggest that the 
association of SMN with snRNPs occurs via direct protein-protein interaction between 

30 SMN and Sm B/B* and one or more of the D group proteins although the possibility 

10873 V.2 IK 



that some component in the rabbit reticulocyte lysate mediates this interaction cannot 
be excluded. 

The binding in solution of SMN to other Sm proteins was examined 
using in vitro translated and ^^[S]-labeled Sm proteins B, Dl, D2, D3, E, F, and G 
5 which were tested for their ability to bind to recombinant GST-SMN fusion protein 
immobilized on glutatWone-Sepharose beads pursuant to the methods described by 
Lehmeier et al. (1994, Proc. Natl. Acad. Sci. USA 91:12317-12321), Herrmann et al. 
(1995, EMBO J. 14:2076-2088) and Raker et al. (1996, EMBO J. 15:2256-2269). As 
shown in Figure 5B, all Sm proteins, except for F and G, bound efficiently to 
10 immobilized GST-SMN, whereas there was no detectable binding to immobilized GST 
alone. Moreover, similar experiments failed to detect any interaction between the Sm 
proteins and SIPl. 

SMN Contains Two Distinct Binding Sites for the Sm Proteins and for 

SIPl 

1 5 The data disclosed previously herein demonstrate that SMN interacts 

with both SIPl and with several of the Sm proteins; therefore, it was determined 
whether binding of SMN to Sm proteins and to SIPl was mutually exclusive or 
whether SMN could bind both Sm proteins and SIPl at the same time possibly via two 
different binding sites on a single SMN protein. Truncated peptides were synthesized 

20 corresponding to the two most conserved regions of SMN (determined by comparing 
the sequence of the human SMN with that of Xenopus SMN) located at the N terminus 
(at amino acids 13-44) and at the C terminus (at amino acids 240-267), under the 
assumption that these highly conserved domains may be involved in important protein- 
protein interactions. These regions are also conserved in several candidate SMN 

25 orthologs identified in divergent organisms (Talbot et al., 1997, Hum. Mol. Genet. 

3:497-500). These truncated peptides were then coupled to BSA (termed BSA-Pl and 
BSAP2, respectively) and used as competitors in the binding assays for SMN to SIPl 
and for SMN to the Sm proteins. Without competitors, Sff 1 and Sm B bound to GST- 
SMN (Figure 5C) as noted previously herein. However, BSA-Pl completely abolished 

30 binding of SIPl to SMN, while the binding of Sm B was unaffected (Figure 5C). In 

10873 V.2 -116- 



contrast, BS A-P2 strongly inhibited the binding of SMN to Sm B but had no effect on 
the binding of SMN to SIPl . BSA coupled to HIV-1 Rev NES protein (BSA-Ctrl) 
(Fischer et al.,1995), was used as a non-specific control and this chimeric protein had 
no effect on the binding of SMN to either SIPl or to Sm B protein (Figure 5C). 
5 Additional experiments demonstrated that the corresponding domains of SMN are 

alone sufficient for binding to SIPl and Sm B, and similar resuUs were obtained for the 
other Sm proteins. Thus, the data disclosed herein define two independent binding 
sites for SIPl and the Sm proteins on a single SMN protein molecule. SMN may 
therefore serve as a critical bridge between the Sm proteins and SIPl and SMN may 
1 0 nucleate the formation of the SMN-SIP 1 -Sm complex. 

Example 2: The SMN-SIPl complex has an essential role in spliceosomal snRNP 
biogenesis 

The experiments presented in this example may be summarized as 

follows. 

15 The data disclosed herein and obtained from micro injection studies in 

Xenopus laevis oocytes provides important insight into the function of the SMA 
disease gene product, SMN, and its associated protein SIPl. SMN and SIPl are tightly 
associated as two subunits of a heteromeric protein complex, and both are found in the 
oocyte cytoplasm. Further, as previously disclosed herein, SMN and SIPl are 

20 associated in a complex of approximately 300 kDa which also contains the Sm 

proteins, and several additional spliceosomal snRNP-specific proteins. The present 
experiments demonstrate that the SMN-SIP 1 complex has an essential role in 
spliceosomal snRNP biogenesis. Several lines of evidence lead to this conclusion. 
First, SMN and SIPl are specifically associated in the cytoplasm with Ul and U5 

25 snRNAs but not with nuclear snRNPs and not with other RNAs tested so far. Thus, 

they are not components of mature nuclear snRNPs but rather are associated with them 
only during the cytoplasmic phase of their biogenesis. Second, anti-SIPl antibodies 
strongly interfere with the assembly of the Sm core domain of spUceosomal U snRNAs 
and with their nuclear import. Third, anti-SMN antibodies, surprisingly, have the 

30 opposite effect and stimulate the assembly of Sm proteins onto the Sm site of 

10873 V.2 117 



spliceosomal U snRNAs. In addition, as previously set forth herein, two distinct 
domains in SMN, PI and P2, have been identified that mediate its interaction with 
SIPl and with several Sm proteins, respectively, in addition to its capacity to interact 
with itself (Liu and Dreyfuss, 1996, EMBO J. 15:3555-3565). 

The Materials and Methods used in the experiments presented in this 
example are now described. 

Western Blot Analvsis 

Oocytes were defoUiculated and dissected into nuclear and cytoplasmic 
fractions (see methods set forth herein). The nuclei were precipitated in ethanoL The 
cytoplasmic fractions were homogenized in 5:1 buffer pursuant to Fischer et al., 1993, 
EMBO J. 12:573-583) and centrifuged for 15 minutes at 4''C. The supernatant was 
transferred to a new test tube and precipitated with four volumes of acetone. After 
centrifugation, the pellets were washed with 70% ethanol, were dried, and the pellets 
were resuspended in SDS-PAGE sample buffer. For Westem blot analysis, proteins 
were separated on an SDS-polyacrylamide gel (12.5%) and subsequently transferred to 
a nitrocellulose membrane (Schleicher and Schuell, Inc., Keene, NH) using a Bio- 
Trans Model B Transblot apparatus (Gehnan Science) according to the manufacturer's 
instructions. 

After protein transfer, the blotting membrane was incubated in blotting 
solution (phosphate-buffered saline, phosphate buffered saline, containing 5% nonfat 
milk) for 1 hour at room temperature, rinsed with phosphate buffered saline, and then 
incubated in blotting solution with the primary antibody for 1 hour at room 
temperature. The membrane was then washed three times with phosphate buffered 
saline containing 0.1 % Tween 20, and bound antibodies were detected using 
peroxidase-conjugated goat antimouse IgG plus IgM (Jackson ImmunoResearch 
Laboratories, Bar Harbor, ME). The proteins were visualized using an ECL Westem 
blotting detection kit (Amersham, Arlington Heights, XL) after washing the membrane 
three times in phosphate buffered saline containing 0.1 % Tween 20. 

Oocvte Injections 



10873 V.2 



-118- 



Oocyte injections were carried out as described in Fischer et al. (1993, 
supra). In brief, oocytes were incubated for 3 hours in modified Earth's solution 
containing 0.2% collagenase type II (Sigma Chemical Co., St. Louis, MO). 
Defolliculated stage V and VI oocytes were collected and usually used on the same day 
for micro injection. 

In a typical injection experiment, 30 nl of ^^[P]-labeled RNA (1 x 10^ 
cpm/^il; total concentration of 0.7 ^M) was injected either into the nucleus or into the 
cytoplasm. For the antibody inhibition experiments, oocytes were preinjected with 
antibody (1 micrograms/^l or 3 micrograms/^il in Figure 9B) and the oocytes were 
incubated for 1 hour before they received a second injection of "^^[Pl-labeled RNA. 
Nucleo-cytoplasmic transport of injected RNAs was monitored by dissection of the 
oocytes into nuclear and cytoplasmic fractions. Both fi-actions were incubated for 20 
minutes in homogenization buffer, and the RNAs were isolated and analyzed by 
electrophoresis on denaturing RNA gels as described in Hamm et al. (1990, Cell 
62:569.577). 

Immunoprecipitation of RNA-Protein Complexes 
Immunoprecipitation of RNA-protein complexes was performed 
pursuant to Fischer et al., 1993, supra). Briefly, the injected oocytes were 
homogenized in 300 ^1 of ice-cold phosphate buffered saline (pH 7.4). The insoluble 
fraction was pelleted by centrifugation, and the clear supernatant was transferred into a 
new 1.5 ml Eppendorf tube containing antibodies bound to protein G-Sepharose beads 
(Pharmacia). This mixture was incubated with constant shaking for 1 hour at 4^C and 
the beads were subsequently washed five times with 1 ml aliquots of ice-cold 
phosphate buffered saline. Bound RNAs were isolated by phenol extraction for 1 hour, 
and the RNAs were then precipitated with ethanol, and analyzed by denaturing gel 
electrophoresis as described previously herein. 

Plasmid DNA In Vitro Transcription and Translation 
Plasmids coding for Ul, U2, U4, U5, and U6 snRNAs have been 
described previously in Hamm et al. (1990, Cell 62:569-577) and Fischer et al. (1993, 
BMBO J. 12:573-583; 1995, Cell 82:475-483). Plasmids encoding dihydrofolate 

10873 V.2 11Q 



reductase mRNA (DUFR) and tRNAl were described in Jarmolowski and Mattaj 
(1993, EMBO J. 12:223-232), Jarmolowski et al. (1994, J. Cell Biol. 124:627-635 ), 
and in Fischer et al (1995, Cell 82:475-483). A plasmid coding for the human U3 
snoRNA. Clones encoding the Sm proteins B, Dl, D2, D3, E, F, and G are described 
5 in Raker et al. (1996, EMBO J. 15:2256-2269). The clone encoding SIPl has been 
described previously herein. In vitro transcription of ^^[P]-labeled RNAs was carried 
out exactly as described in Fischer et al. (1993, EMBO J. 12:573-583). Labeled RNA 
was precipitated in ethanol and resuspended in water. For nuclear injection, RNA was 
dissolved in water containing 10 milligrams per milliliter dextran blue (MW 
10 1,000,000). In vitro translation of "^^[S]-labeled proteins was carried out using a 

combined transcription and translation kit (TnT) (Promega Biotech, Madison, WI) 
according to the instructions of the manufacturer. 

The Results of the experiments presented in this example are now 

described. 

15 As previously disclosed herein, a novel protein complex containing 

SMN, SIPl, and spliceosomal snRNP proteins, including several of the Sm proteins, 
has been discovered in the cytoplasm of human cells. The data presented herein were 
obtained in the Xenopus oocyte system which is an advantageous, well-characterized 
system in which to study spliceosomal snRNP biogenesis by use of microinjections 

20 (Mattaj and DeRobertis, 1985, Cell 40:1 1 1-118; Mattaj, 1986, Cell 46:905-91 1). 

SMN and SIPl Are Associated with SpUceosomal snRNAs in the 

Cvtoplasm 

As previously disclosed herein, a novel protein complex containing 
SMN, SIPl, and spliceosomal snRNP proteins, including several of the Sm proteins is 

25 present in the cytoplasm of human cells. Because the Xenopus oocyte provides a 

particularly advantageous and well-characterized system in which to study 
spliceosomal snRNP biogenesis by use of microinjections (Mattaj and DeRobertis, 
1985; Mattaj, 1986), it was first determined whether SMN and SIPl are present in 
these oocytes. If so, the unique features of this system could then be used to 

30 investigate the possible functions of these proteins in snRNP biogenesis. 



10873 V.2 



- 120- 



Immunoblotting with monoclonal antibodies to the human SMN and 
SIPl proteins (2B1 and 2E17, respectively) on Xenopus tissue culture cells detected 
proteins of similar size to the corresponding human proteins. cDNA cloning, 
sequencing, and transfection experiments confirmed that these proteins are the Xenopus 
5 homologs of SMN and SIPL Surprisingly, however, imlike the situation in somatic 
cells where there is more SMN and SIPl in the nucleus, both proteins were found 
ahnost exclusively in the oocyte cytoplasm (Figure 6A). The high cytoplasmic 
concentration of SMN and Sff 1 in the Xenopus oocyte is reminiscent of the large 
amounts of Sm proteins that are stored in the oocyte cytoplasm (Zeller et al., 1983, Cell 
10 32:425-434). 

In order to identify potential cellular targets for the SMN-SIPl complex, 
the association of the complex with RNA was investigated. Various ^^[P]-labeled 
RNAs were generated by transcription in vitro, including mRNA, tRNA, U3 snoRNA, 
5S RNA, and the spliceosomal snRNAs Ul, U2, U4, U5, and U6. Different mixtures 

15 of these RNAs were then coinjected into the cytoplasm of oocytes; and 

immunoprecipitations were carried out with anti-SMN (2B1) and anti-SIPl (2E17) 
monoclonal antibodies. As shown in Figure 6B, only Ul and U5 snRNAs were 
efficiently immunoprecipitated, indicating that they interact with SMN and SIPl . A 
weak but reproducible immunoprecipitation of U4 snRNA was also observed. In 

20 contrast, other RNAs, including mRNA, tRNA, U3 snoRNA, U6 snRNA, U2 snRNA 

(Figwe 6B), and 5S rRNA were not immxmoprecipitated at significant levels with 
either the anti-SMN or the anti-SIPl antibodies. 

Since neither SMN nor SIPl is detectable in the oocyte nucleus (Figure 
6A), it seemed possible that they are not associated with mature Ul and U5 snRNPs 

25 but rather only during the cytoplasmic phase of their biogenesis pathway. To ascertain 
this, Ul and U5 snRNAs were injected into the cytoplasm of oocytes, and the oocytes 
were incubated for 12 hours (Figure 7). After this incubation period, approximately 
50% of the injected snRNA was transported to the nucleus while the rest was still in 
the cytoplasm. Immunoprecipitations firom the nuclear and cytoplasmic fi-actions were 

30 then carried out using either anti-SMN antibody, anti-SIPl antibody, or the anti-Sm 

10873 V.2 1^1 



monoclonal antibody Y12, and the coimmunoprecipitated RNAs were analyzed by 
denaturing gel electrophoresis followed by fluorography. As previously discussed 
herein, anti-Sm monoclonal antibody Y12 recognizes a subset of the Sm proteins and 
was used in this experiment to monitor the assembly of the Sm core domain (Lemer et 
5 al, 1981, Proc. Natl. Acad. Sci. USA 78:2737-2741). As previously reported (Mattaj, 

1986, Cell 46:905-91 1; Fischer and Luhrmann, 1990, Science 249:786-790), Ul and 
U5 snRNAs were immunoprecipitated by Y12 in approximately equal amounts from 
the nucleus and cytoplasm (Figure 7). This indicated that the Sm proteins associate in 
the cytoplasm with the snRNA and then move as an assembled and stable snRNP 

10 complex to the nucleus. In striking contrast, however, SMN and SIPl association with 
Ul and U5 snRNAs was observed only in the cytoplasm (Figure 7). 

Further evidence for the physiological relevance of the interaction of 
SMN and SIPl in the cytoplasm with spliceosomal snRNPs was obtained following 
nuclear injections of Ul snRNA. Only after export to the cytoplasm and during the 

1 5 cytoplasmic phase of their biogenesis could Ul snRNA be inmiunoprecipitated by anti- 
SMN or anti-SIPl antibodies. Thus, the SMN-SIPl complex interacts with Ul and U5 
snRNAs in the cytoplasm but not after these snRNAs have been assembled into 
snRNPs and have been imported into the nucleus. Therefore, SMN and SIPl dissociate 
from the spliceosomal snRNPs either prior to nuclear entry or shortly thereafter. 

20 Anti-SIPl Antibodies Inhibit Spliceosomal snRNP Assemblv and 

Nuclear Import 

The association of SMN and SIPl with Ul and U5 snRNPs in the 
cytoplasm suggested a role for these proteins in the biogenesis pathway of these 
snRNPs, I.e., in the assembly of snRNP proteins onto these snRNAs and/or in the 

25 nuclear import of these particles. Therefore, whether anti-SMN or anti-SIPl antibodies 
have an effect on the nucleo-cytoplasmic transport of spliceosomal snRNPs was 
examined (Figure 8). Anti-SMN (2B1), anti-SIPl (2E17), or anti-Sm (Y12) antibodies 
were injected into the cytoplasm of oocytes. One hour later, a mixture of Ul, U5, and, 
as a control for nuclear injection, U6 snRNA was injected into the nuclei of the same 

30 oocytes, and the incubation was continued for 15 hours. The oocytes were then 

10873 V.2 1 OO 



fractionated, and the RNAs in the nucleus and cytoplasm were analyzed. In control 
oocytes preinjected with water, nuclear-injected Ul and U5 snRNAs were exported to 
the cytoplasm and, after cytoplasmic assembly of the Sm core domain, reimported to 
the nucleus (Figure 8A) (Hamm et al, 1990, Cell 62:569-577; Neuman de Vegvar and 
5 Dahlberg, 1990, Mol. Cell. Biol. 10:3365-3375; Terns et al, 1993, Genes Dev. 7:1898- 
1906). This was confirmed by immunoprecipitation of Ul and U5 snRNAs with anti- 
Sm antibodies and by the observation that the nuclear pool of Ul snRNA had 
undergone 3' end trinuning (see Figure 8A, nuclear fractionis). The latter has been 
previously shown to occur in the cytoplasm prior to nuclear import (Neuman de 

10 Vegvar and Dahlberg, 1990, supra\ Tems et al., 1993, supra). In oocytes preinjected 
with anti-Sm antibody Y12, Ul and U5 were also exported to the cytoplasm; however, 
they were not ramparted to the nucleus, resulting in the accumulation of these RNAs in 
the cytoplasm (Figure 8A). This is because upon binding to the Sm proteins, Y12 
interferes with the subsequent steps in the biogenesis of snRNPs that are required for 

15 their nuclear import as discussed below. Surprisingly, a similar result was obtained in 
oocytes preinjected with anti-SIPl antibodies (Figure 8 A). However, no effect on 
nuclear import of Ul and U5 snRNA was observed in oocytes injected with anti-SMN 
antibodies (Figure 8A). 

Next, it was determined whether anti-SIPl antibodies interfere with the 

20 nuclear import of the other spliceosomal snRNPs. In vitro transcribed snRNAs Ul, 
U2, U4, and U5 were injected into the cytoplasm of oocytes, either without or with 
anti-SIPl antibody, and nuclear import was then assessed 15 hours later (Figure 8B). 
In the absence of anti-SIPl antibody injection, all snRNAs accumulated in the nucleus 
to approximately 50%, although the import of U4 was less efficient (Figure 8B). 

25 However, in the presence of anti-SIPl antibody, the nuclear import of Ul, U2, and U5 
was almost completely inhibited, and the import of U4 was delayed by at least 50% 
(Figure 8B). Thus, anti-SIPl antibodies interfere with the nuclear import of all 
spliceosomal U snRNPs tested regardless of whether they can be efficiently 
immunoprecipitated with the anti-SMN or anti-SIPl antibodies. This suggests that the 



10873 V.2 



-123- 



interaction of SMN and SIP with some snRNAs is transient and cannot be monitored 
by immunoprecipitation. 

As discussed previously herein, assembly of the Sm core domain and 
the formation of the m^G cap are required for the nuclear import of U snRNPs (Mattaj 
5 and DeRobertis, 1985, Cell 40:1 1 1-118; Fischer and Luhrmann, 1990, Science 
249:786-790; Hamm et al, 1990, Cell 62:569-577; Fischer et al, 1993, EMBO J. 
12:573-583). Therefore, anti-SIPl antibodies may inhibit U snRNP import by 
interfering either with the assembly of the Sm core or with the cap hypermethylation or 
both. Alternatively, and not mutually exclusive, the antibody could directly interfere 

10 with the snRNP transport process, e.g., by blocking transport factors. 

To further examine the inhibitory effect of anti-SIPl antibodies on U 
snRNP nuclear import, the effect of anti-SIPl antibodies on the assembly of the Sm 
core domain was studied (Figure 9A). Anti-SIPl, anti-SMN or antiSm (Y12), and, as 
negative controls, nonimmime antibodies SP2/0 or water, were injected into the 

15 cytoplasm of oocytes, followed by incubation for 1 hour before receiving a second 

cytoplasmic injection of a mixture of ^^[P]-labeled Ul, U5, and U6 snRNAs. One hour 
later, the Sm core assembly on the injected RNAs was analyzed by 
immunoprecipitation with Y12 (Figure 9A). In control oocytes preinjected with either 
water or SP2/0, both Ul and U5 snRNAs were efficiently immunoprecipitated with 

20 Yl 2, indicating that these RNAs assembled with the Sm proteins. As expected, U6 

snRNA was not immunoprecipitated because this RNA does not contain an Sm site and 
thus cannot bind Sm proteins. However, injection with anti-SIPl antibody completely 
inhibited the Sm core assembly on Ul and U5 snRNAs (Figure 9A). Y12 itself 
strongly reduced the Sm core assembly on both Ul and U5 snRNAs to a similar extent 

25 (Figure 9A). The anti-SMN antibody 2B1, in contrast, did not inhibit but rather 

slightly stimulated assembly of Sm proteins (Figure 9A). If the snRNAs are injected 1 
hour prior to injection of the anti-SIPl antibody, no interference with subsequent Y12 
immunoprecipitation is observed. The observed inhibition of Sm core assembly by 
pre-injection of anti-SIPl is therefore not due to a nonspecific occlusion of the Sm 

30 epitope by the anti-SIPl antibody (Figure 9A). 



10873 V.2 



-124- 



The effects of injection of anti-SIPl antibodies on the assembly of U2 
and U4 snRNAs were also determined. As shown in Figure 9 A, pre-injection of anti- 
SDPl antibodies strongly interfered with the Sm core assembly on both U2 and U4 
snRNAs, while in oocytes preinjected with water, both snRNAs assembled with the Sm 
5 proteins. The inhibition of nuclear import of snRNAs Ul, U2, U4, and U5 by anti- 
SIPl antibodies is, therefore, at least partially due to the interference of the anti-SIPl 
antibody with the Sm core domain assembly. Thus, without wishing to be bound by 
theory, SIP 1 is a cytoplasmic assembly factor that mediates the formation of the Sm 
core domain on spliceosomal U snRNPs. 

10 Anti-SMN Monoclonal Antibody Stimulates Sm Protein Binding onto 

the Sm Site of U snRNAs 

Although SMN is in a tight complex with SIPl and, therefore, is likely 
to form a functional unit with SIPl, the anti-SMN antibodies used in the experiments 
described above, unlike the anti-SIPl antibodies, did not interfere with U snRNP 

15 biogenesis; rather, anti-SMN antibodies stimulated core domain assembly. Therefore, 
the function of SMN in snRNP assembly was examined further. As shown in Figure 
9B, injection of anti-SMN antibodies (231) did not inhibit snRNP assembly. However, 
upon injection of high concentrations (2 to 3 micrograms/fil ) of this anti-SMN 
antibody, the assembly of snRNPs was often enhanced. In order to analyze this effect 

20 in more detail, a mixture of U5 and U6 snRNAs was injected along with either high (3 

micrograms/^il ) or low (1 micrograms/^il) concentrations of the anti-SMN antibody 
2B1 or the anti-SIPl antibody 2E17 (Figure 9B). Sm protein binding onto these RNAs 
was then assessed by immunoprecipitation with the anti-Sm antibody Y12 one hour 
later. After 1 hour, the assembly of Sm proteins onto U snRNAs was not complete, 

25 thus allowing a more quantitative evaluation of the efficiency of Sm protein binding. 

In the absence of coinjected antibody, U5 but not U6 was precipitated by anti-Sm, 
indicating Sm core formation on U5 snRNA but not on U6 at that time point. 
However, while low concentrations of 2B1 had only a slight stimulatory effect on Sm 
protein binding, coinjection of higher 2B1 concentrations significantly enhanced the 

30 assembly approximately 2- to 3-fold (Figure 9B). Coinjected anti-SIPl antibody, in 

10873 V.2 10^ 



contrast, inhibited the assembly of Sm proteins onto U5 snRNA almost entirely at both 
low and high antibody concentrations (Figure 9B). Thus, 2B1 stimulates Sm protein 
binding onto U5 snRNA and hence Sm core formation, indicating that SMN is also 
involved in spliceosomal U snRNP assembly. 
5 The data disclosed herein further demonstrate that 2B 1 stimulates Sm 

protein binding onto an artificial snRNA, termed Smll RNA, that consists of the Sm 
site and stem/loop E of Ul snRNA and an artificial stem/ loop 5' to the Sm site. The 
capacity of this RNA to bind Sm protein is severely compromised, and, as a 
consequence of this, its nuclear import is greatly reduced (Fischer et aL, 1993, EMBO 

10 J. 12:573-583). Employing the same injection strategy as described above, a strong 
stimulation of Sm protein binding onto Smll RNA was observed after injection with 
anti-SMN antibody 2B1, and, as a consequence, Smll was efficiently imported into the 
nucleus. Taken together, and without wishing to be bound by theory, these data and 
those presented in the preceding sections strongly suggest that both SIPl and SMN are 

1 5 directly involved in the assembly of the Sm core domain of spliceosomal U snRNPs. 

Example 3: Specific sequences in SMN and SIPl which mediate their 
interactions with each other and with Sm proteins and which are associated with 
defective interactions in SMA 

The experiments presented in this example may be summarized as 

20 follows. 

Deletion mutants of SMN and SIPl were prepared and used to identify 
the domains involved in the interactions between SMN and SIPl as well as their 
interactions with their associated proteins such as the Sm proteins. Further, the effects 
of specific deletions and point mutations on the composition of the SMN complex and 

25 on the SMN and SIPl cellular localization, with particular interest in targeting to gems, 

were analyzed. The data disclosed herein demonstrate that several SMN interactions 
are affected by mutations that occur in some SMA patients and that SMN 
oligomerization greatly enhances its interaction with Sm proteins. These results 
suggest a model of the SMN complex in which a SMN/SIPl tetramer (or a higher 

30 oligomer) is the fimctional core required for efficient binding to Sm proteins, and thus 

10873 V.2 19A 



snRNP assembly. These findings further strengthen the view that SMA is the resuU of 
a defect in snRNP metabolism. 

The Materials and Methods used in the experiments presented in this 
example are now described. 
5 Plasmid Construction 

DNA fi-agments corresponding to the open reading frames (ORFs) of 
SMN and SIPl wild-type (wt) and mutant proteins were generated by polymerase 
chain reaction (PGR) amplification using suitable primers. All of the myc-tagged 
constructs were generated by cloning the PGR inserts into a modified pcDNAS vector 

10 (InVitrogen, Garlsbad, GA) downstream fi-om the myc epitope recognized by the 

monoclonal antibody 9E10 (Siomi and Dreyfiiss, 1995, J. Gell. Biol. 129:551-560). 
Plasmid constructs containing SMN fusions to the carboxyl-terminal to the myc- 
pyruvate kinase (PK) were obtained by cloning the SMN coding PGR firagments into a 
myc-PK vector derived from pcDNAS as previously described (Nakielny and Dreyfiiss, 

15 1996, J. Gell. Biol. 134:1365-1373). Maltose binding protein (MBP)-SMN fiisions 
were obtained by cloning the SMN coding PGR firagments into a modified pcDNAl 
vector (InVitrogen, Garlsbad, GA) downstream from the MBP sequence. These 
vectors facilitated the expression driven by the GMV promoter in vivo and by the T7 
promoter in vitro. All of the constructs were analyzed using DNA sequencing. 

20 Production of proteins in vitro 

The [^^S]methionine-labeled proteins were produced by an in vitro 
coupled transcription-translation reaction (Promega Gorp., Madison, WI) in the 
presence of [^^S]methionine (Amersham, Arlington Heights, XL). Hise-tagged SMN 
and SmB fusion proteins were expressed from a pET28 bacterial expression system in 

25 the E.coli strain BL21(DE3)pLysS and the fusion protein was purified by Ni+ chelation 

chromatography with the Novagen (Madison, WI) His-bind Buffer Kit following the 
manufacturer's protocol. All the GST fusion proteins were expressed from the GST 
expression vector pGEX-5X (Pharmacia) in the E.coli strain BL21(DE3)pLysS and 
were purified using glutathione-Sepharose according to the manufacturer's protocol 

30 (Pharmacia Biotech). SmB cDNA is described in Raker et al (1 996, EMBO J. 



10873 V.2 



-127- 



15:2256-2269). SMN and SIPl cDNAs were obtained using a yeast di-hybrid 
screening assay previously described herein. 

In vitro protein-binding assay 

Purified GST or GST fusion proteins (1-3 micrograms) were incubated 
5 with 10^ cpm of the in vitro translated protein product and 25 ^il of glutathione- 

Sepharose beads in 1 ml of binding buffer (50 mM Tris HCl [pH 7.5], 200 mM NaCl, 2 
mM EDTA, 0.1% NP-40, 2 micrograms per milliliter leupeptin and pepstatin A, and 
0.5% aprotinin). After incubation for 1 hour at 4°C, the resin was pelleted, washed five 
times with 1 ml of binding buffer, and the bound protein fraction was eluted by boiling 

10 the washed beads in SDS-PAGE sample buffer. The eluted proteins were analyzed by 
SDS-PAGE on a 12.% polyacrylamide gel, and the radiolabel signal was enhanced by 
treatment with Amplify solution (Amersham, Arlington Heights, IL). 

In the preincubation experiments, the indicated molar excess of purified 
recombinant His-tagged SMN proteins were incubated with GST or GST-SMN, 

15 previously bound to glutathione-Sepharose beads, for 1 hour at 4°C in 1 ml of binding 
buffer. Unbound proteins were eliminated using five washes of the beads with binding 
buffer. After the beads were washed to remove unbound proteins, the in vitro 
translated proteins were added to the beads and binding was performed as described 
previously herein. 

20 Gel-filtration Chromatography 

Purified recombinant His-tagged SMN, SMNY272C, or SMNAEx? 
(also referred to as SMNAC16) (50 micrograms), and SmB (25 micrograms) were 
incubated, individually or mixed as indicated, for 1 hour on ice in 0.25 miUiliters of a 
buffer containing 50 mM Hepes, pH 7.9, 400 mM KCl, 0.5 mM EDTA, 2.5 mM DTT. 

25 The samples were then applied to a TSK-GEL G3000-SW glass column (08800; 

Tosohaas, Montgomeryville, PA). The column was equilibrated in the same buffer as 
mentioned previously herein. One minute fractions were collected at a 0.25 milUliter 
per minute flow-rate. Selected fractions were pooled as indicated elsewhere herein, 
and the samples were analyzed using SDS-PAGE and Western blotting using anti-T7 

30 tag monoclonal antibody (Novagen, Madison, WI). 



Cell culture and Immunoprecipitation 

293T cells were cultured in Dulbecco's modified Eagle's medium 
(DMEM) (GIBCO BRL, Gaithersburg, MD) supplemented with 10% fetal bovine 
serum (FBS) (GIBCO BRL) and transfected by the standard calcium phosphate 
5 procedure. Following 36 to 48 hours posttransfection, the cells were collected and 

processed by immunoprecipitation. Immunoprecipitations were performed using total 
cell lysates prepared in the presence of 0.5% Triton X- 100 as described previously in 
Pinol-Roma et al. (1988, Genes Dev. 2:215-227). 

Immunoblotting was performed as described previously elsewhere 
10 herein. The antibodies used for these experiments were as follows: mouse monoclonal 
anti-SIPl (2E17, described previously elsewhere herein), mouse monoclonal anti-Sm 
(Y12; Lemer et al., 1981, Proc. Natl. Acad. Sci. USA 78:2737-2741), mouse 
monoclonal anti-myc (9E10; ATCC, Manassas, VA), and mouse monoclonal anti-T7 
tag (Novagen, Madison, WI). 
15 The Results of the experiments presented in this example are now 

described. 

SMN mutations of SMA patients affect the direct interaction of SMN 
* with itself and with SmB 

In an in vitro binding assay, purified recombinant His-tagged SMN and 
20 SmB proteins bound to a GST-SMN fusion protein but not to GST alone (Figure lOA). 

This demonstrates that SMN interacts directly with itself and, although with a low 
affinity, with SmB and rules out a possible bridging effect by other components such as 
those that may be present in the reticulocyte lysate used in experiments disclosed 
previously elsewhere herein and in, for example, Lorson et al. (1998, Nature Genet. 
25 19:63-66). Next, the effect on these interactions of two well characterized mutations 

found in SMA patients, the point mutant SMNY272C and the exon 7 deletion mutant 
(SMNAEx7), was examined. Figure lOB depicts that both mutations severely affected 
not only SMN self-association (Lorson et al., 1998, Nature Genet. 19:63-66) but also 
the interaction of SMN with SmB. In contrast, no effect was observed on the 
30 interaction of SMN with SIP 1 , which involves the amino terminus of SMN (as 

10873 V.2 10Q 



disclosed previously elsewhere herein). GST-SIP 1 binds equally efficiently to full- 
length SMN and to both mutants. GST alone, used as a control, exhibited no 
detectable binding to SMN. Because SMA is a motor neuron disease, the interaction of 
the neuronal-specific Sm protein, SmN, with SMN wild type (SMNwt), SMNY272C, 
5 and SMNAEx? was also analyzed and found them to be identical to those of SmB. 

These data demonstrate that the SMN self-association and SmB-binding domains share 
common determinants within the YG box and that both SMN/SMN and SMN/SmB 
interactions are affected by mutations that cause SMA. A schematic summary of 
SMN-interacting domains is shown in Fig. 15 A. 

10 SMN Self- Association Enhances the Interaction with Sm Proteins. 

To determine whether or not these SMN/SMN and SMN/SIPl 
interactions were mutually exclusive, beads containing GST-SMN, or GST as a 
control, were preincubated with a molar excess of recombinant His-SMN to form SMN 
oligomers. Then, after washing away the unbound SMN, in vitro translated, 

15 [^^SJmethionine-labeled SMN, SmB, or SIPl was added to the beads and binding of the 
labeled proteins was assessed (Figure 1 1 A). SMN binding was reduced only partially 
by the preincubation with recombinant His-SMN, suggesting that the oligomerization 
capacity of SMN on the beads has not been saturated. Surprisingly, SmB binding is 
dramatically enhanced by SMN self-association. SIPl binding is slightly increased 

20 presumably because additional binding sites become available with the boimd His- 
SMN. The specificity of this effect is demonstrated further by the lack of binding to 
control GST-bearing beads. Further, it was determined whether this effect was 
exhibited with other Sm proteins known to bind SMN (e.g., as discussed previously 
elsewhere herein). Fig. 1 IB depicts that SMN self-association greatly stimulates its 

25 interaction with SmB, SmDl, and SmD3 but not with SniD2 and SmE. SmF and SmG 

do not bind SMN under any conditions examined. The binding efficiency of Sm 
proteins to GST-SMN is lower than previously disclosed elsewhere herein (e.g.. 
Example 1) because of the more stringent buffer conditions employed in this study. 

Several lines of evidence argue against the possibility that the increased 

30 binding of Sm proteins merely reflects the presence of additional interaction sites on 

10873 V.2 ion 



the bound His-SMN: (i) even at the highest concentration tested, the amount of bound 
His-SMN is roughly equivalent to that of GST-SMN; (ii) consistently, SMN contains a 
binding site for SIPl independent of that for Sm proteins, and SIPl binding is only 
slightly increased by the addition of His-SMN (Figure 1 1 A); and (iii) the effect of 
5 SMN self-association is not observed with SniD2 and SmE, whose basal binding 

otherwise is similar to the one of SmB, SmDl, and SmD3 (Figure 1 IB). These data 
indicate that SMN self-association and Sm proteins interaction are not mutually 
exclusive but, on the contrary, that SMN self-association very strongly and specifically 
increases its affinity for a subset of Sm proteins. 

10 A titration analysis of the stimulating effect of SMN oligomerization on 

SmB binding is depicted in Figure 12A and 12B. The amount of His-SMN bound to 
GST-SMN immobihzed on glutathione-Sepharose beads, after the preincubation step, 
was determined by Westem blotting. The increase in SmB binding correlated with the 
extent of SMN self-association (Fig. 12 A). When the bound amount of wild-type His- 

15 SMN was equivalent to the amount of GST-SMN on the beads, SmB binding increased 

approximately 10-fold (Fig. 12A, lane 4X). SmB binding increased about 30-fold, and 
more than about 60% of the input was bound when the amount of His-SMN was 
approximately 2.5-fold greater than GST-SMN (Fig. 12 A, lane 8X). Without wishing 
to be bound by theory, it appears that GST-SMN is predominantly in a monomeric 

20 form, probably because the GST fusion or the interaction of the GST with the 

glutathione-Sepharose interferes with the self-association of GST-SMN on the beads. 
Proportional to their reduced ability to self-associate, a greater molar excess of 
recombinant SMNY272C and SMNAEx? than wild-type SMN (SMNwt) was required 
to obtain similar levels of association with GST-SMNwt. Although SMNY272C was 

25 still able to stimulate SmB binding, SMNAEx? did not (Figure 12A). In the case of 

SMNY272C self-association, the extent of stimulation of SmB binding was very low 
(Figure 12B). SMNAEx? self-association was too inefficient to be analyzed. Thus, 
SMN self-association most likely creates a high-affinity binding site for Sm proteins 
and SMN mutations found in SMA patients affect the ability of SMN to form the Sm- 



10873 V.2 



-131- 



binding site. Moreover, the binding site formed with the SMN mutants has a lower 
affinity for Sm proteins than that formed by wild-type SMN. 

SMN Oligomerization Is Impaired in Mutants of SMA Patients and Is 
Required for Binding to Sm Proteins. 
5 Although SMN self-associates and is part of a large, macromolecular 

complex in vivo, it could not be distinguished whether this is due to the presence of 
multiple copies of SMN and/or of additional proteins {see, eg.. Example 1). 
Moreover, previous in vitro experiments showing defective self-association of SMN 
mutants (Lorson et al., 1998, Nature Genet. 19:63-66) were performed under solid-state 

10 conditions that did not allow a distinction between dimerization and oligomerization. 
Dimerization and oligomerization likely are different in terms of the interaction 
surfaces required for a protein to self-associate because oligomerization would involve 
at least two independent binding sites. Similarly, the data disclosed herein strongly 
suggest that SMN oligomerization enhances the interaction of SMN with Sm proteins, 

15 but no direct evidence that SMN indeed can oligomerize by itself had been provided so 
far. 

The ability of SMN to oligomerize was assessed using gel-filtration 
chromatography of purified recombinant His-tagged SMN wild-type and mutant 
proteins. Figure 13 A depicts that SMN alone was able to fomi large oligomers of up to 

20 a molecular mass corresponding to approximately 500 kDa. In contrast, mutants 

SMNY272C and SMNAEx? were severely impaired in their ability to form oligomers. 
Next, the predicted requirement of SMN oligomerization for Sm protein interaction 
was examined. As depicted in Figure 13B, SmB associated with SMN large oligomers 
in the high-molecular-weight fi-actions. In contrast, no association between SMNAEx? 

25 and SmB as larger-size complexes could be detected. These results provide direct 

evidence that SMN is able to oligomerize and is found almost exclusively self- 
associated in large ohgomeric complexes. Furthermore, SMN mutations found in 
SMA patients disrupt such oligomerization and the interaction with Sm proteins. 

Reduced Association of SMN Mutants with snRNPs in Vivo. 



10873 V.2 



-132- 



The data disclosed previously herein suggest that SMN mutants found 
in vivo such as SN4NY272C and SMNAEx? would associate with Sm proteins less 
efficiently than SMN wild type. To test this hypothesis, 293T cells were transfected 
with either myc-tagged SMN wild type or SMNY272C or SMNAEx7. All the myc- 
5 tagged transfected proteins were expressed at similar levels as determined by Western 
blotting by using antibodies against the myc tag (Figure 14). By 
coimmunoprecipitation with anti-SIPl antibodies, comparable levels of wild-type and 
mutant proteins were detected associated with SIPl as a SMN/SBPl complex. 
However, immunoprecipitation using the anti-Sm mAb Y12 demonstrated that the 

10 association of SMNY272C and SMNAEx7 with Sm proteins is reduced markedly 

compared with the association with SMNwt. A complex comprising the SMN mutants 
and Sm proteins was still detected because the reduced ability of the mutants to form 
oligomers with the wild-type SMN is partially overcome by the overexpression and by 
the possible contribution of other SIPs in vivo (e.g., Example 1). 

15 Figure 15B depicts a summary model of several interactions in the SMN 

complex. SMN appears to be associated with SIPl most, if not all, of the time (see, 
e.g.y Example 1). Moreover, SIPl interaction with SMN may have an effect on SMN 
oligomerization or Sm protein interaction. Monomeric SMN has only a low affinity 
for Sm proteins because a high-affinity Sm-binding domain forms only upon SMN 

20 oUgomerization. For simplicity, SMN has been depicted bound to Sm proteins as a 
dimer; however, the actual stoichiometry of the SMN oligomers is not yet known. 
Specific protein-protein interactions between the various Sm proteins are required for 
the ordered assembly of the Sm core (Raker et al., 1996, EMBO J. 15:2256-2269). The 
data disclosed herein demonstrate that SMN is able to form large oligomers, and, in 

25 such a conformation, it binds with high affinity to a subset of Sm proteins. Without 
wishing to be bound by any particular theory, it may be that the SMN oligomer is the 
fiinctional core that allows the SMN complex to fiinction in snRNP assembly (e.g., 
Example 2) and spliceosome regeneration (e.g.. Example 4). Importantly, SMN 
mutations found in SMA patients directly affect SMN oligomerization and Sm protein 

30 binding. Thus, the loss-of-fimction phenotype of mutant proteins such as SMNY272C 



10873 V.2 



-133- 



and SMNAEx? in pre-mRNA splicing, as disclosed elsewhere herein, is most likely the 
direct result of an impaired interaction with the Sm proteins. These findings directly 
link the molecular mechanism of SMA to a deficiency in the interaction of SMN with 
spliceosomal snRNP Sm proteins, A detailed knowledge of the structure of the 
5 SMN/SIPl complex with Sm proteins will lead to further insights into the mechanisms 
of SMN function and suggest possible therapeutic approaches for SMA. 

Example 4: Novel function of SMN in pre-mRNA splicing 

The experiments presented in this example may be sununarized as 

follows. 

10 As discussed previously elsewhere herein, SMA is a common motor 

neuron degenerative disease that results from reduced levels of, or mutations in, the 
Survival of Motor Neurons (SMN) protein. SMN is found in the cytoplasm and the 
nucleus where it is concentrated in gems. SMN interacts with spliceosomal snRNP 
proteins and is critical for snRNP assembly in the cytoplasm. The data disclosed 

15 herein demonstrate that a dominant-negative mutant SMN (SMNAN27) causes a 

dramatic reorganization of snRNPs in the nucleus. Furthermore, SMNAN27 inhibits 
pre-mRNA splicing in vitro, while wild-type SMN stimulates splicing. SMN mutants 
found in SMA patients cannot stimulate splicing. These data demonstrate that SMN 
plays a crucial role in the generation of the pre-mRNA spUcing machinery and thus in 

20 mRNA biogenesis, and the data link the function of SMN in this pathway to SMA. 

The Materials and Methods used in the experiments presented in this 
example are now described. 

Constructs and recombinant proteins production 

DNA fragments corresponding to the open reading frames of SMN wild 
25 type (SMN wt), SMNY272C, SMNAEx?, and SMNAN27 were generated by PGR 
amplification using specific primers. In order to effect transient expression of the 
inserts in HeLa cells, the inserts were cloned downstream of the CMV promoter into a 
modified pcDNA3 vector (InVitrogen, Carlsbad, CA) further containing the myc-tag 
reporter protein sequence corresponding to the epitope recognized by the monoclonal 
30 antibody 9E10 (Siomi and Dreyfiiss, 1995, J. Cell Biol. 129:55 1-560). 



10873 V.2 



-134- 



For the production of purified recombinant proteins, the same inserts 
cloned into pcDNA3 were also cloned into the pET28 vector (Novagen Inc., Madison, 
WI). His6-SMN fusion proteins were expressed in the Exoli strain BL21(DE3)pLysS 
and the proteins were purified by Ni^"^ chelation chromatography using the Novagen 
5 (Madison, WI) His-bind Buffer Kit following the manufacturer's protocol. Purified 
recombinant proteins were dialyzed against buffer D as described by Dignam et al. 
(1983, Nuc. Acids Res. 11:1475-1489). 

The Ad-2 AIVS construct, containing LI exon (41 nucleotides), a 
deleted form of the first intervening sequence (IVSl, 231 nucleotides) and L2 exon (72 
10 nucleotides) of adenovirus 2 (Ad-2) major late transcription unit, was generated by 
PGR amplification using pRSP-l-AIVS as a template (Konarska et al., 1984, Cell 
38:731-736) and the construct was cloned into pcDNA3 (InVitrogen, Carlsbad, CA). 

Cell culture and treatments 

HeLa cells were cultured in Dulbecco's modified Eagle's medium 
15 (DMEM, GIBCO-BRL, Gaithersburg, MD) supplemented with 10% fetal calf serum 
(PCS; GIBCO-BRL). 

HeLa cells, plated on glass coverslips, were transfected using the 
standard calcium phosphate method. Following overnight incubation with DNA, the 
cells were washed and fresh medium was added. Transfected cells were then fixed and 
20 processed for immunofluorescence staining after an additional 24-36 hours of 
incubation as set forth previously herein. 

Immunofluorescence microscopv 

Immunofluorescence staining was carried out essentially as previously 
described (Choi and Dreyfiiss, 1984, J. Cell. Biol. 99:1997-2004). Double-label 

25 immunofluorescence experiments were performed by separate sequential incubations 
of each primary antibody, diluted 1:1000 in PBS containing 3% (w/v) BSA, followed 
by the specific secondary coupled to fluoresceineisothiocyanate (FITC) or Texas Red 
(TXRD). All antibody incubations were performed at room temperature for 1 hour. 
Laser confocal fluorescence microscopy was performed using a Leica TCS 4D 

30 (Germany) confocal microscope. Images fi-om each channel were recorded separately 



10873 V.2 



- 135- 



and then the data files were merged. Antibodies used in these experiments were as 
follows: Rabbit polyserum anti-p80 coilin (R288, described in Andrade et al, 1993, 
Proc. Natl. Acad. Sci. USA 90:1947-1951), mouse IgGl monoclonal anti-SMN (2B1), 
mouse IgGl monoclonal anti-TMG cap (K121; Krainer, 1988, Nucleic Acids Res. 
5 16:9415-9429) (Calbiochem, San Diego, CA), mouse IgG3 monoclonal anti-Sm (Y12; 

Lemer et al., 1981, Proc. Natl. Acad. Sci. USA 78:2737-2741), mouse IgGl 
monoclonal anti-myc (9E10), and affinity purified rabbit polyserum anti-myc (A- 14, 
obtained from Santa Cruz Biotech, Santa Cruz, CA). 

In situ hybridization was performed following the protocol described by 

10 Matera and Ward (1993, J. Cell Biol. 121:715-727). Briefly, the oligonucleotide probe 
was a biotinylated 2 -OMe RNA complementary to U2 snRNA region 30-43 
(Wassarman and Steitz, 1991, Mol. Cell. Biol. 11:3432-3445) and the biotinylated 
probe was detected using streptavidin directly coupled to TXRD (GIBCO-BRL), 
In vitro transcription and splicing assay 

15 For use as templates for in vitro transcription, pSP14-15 (Ohno and 

Shimura, 1996, Genes & Dev. 10:997-1007) and Ad-2 AIVS were linearized with 
Smal and Xbal, respectively. In vitro transcription was carried out and transcribed 
RNAs were purified as described by Ohno and Shimura (1996, Genes & Dev. 10:997- 
1007). In vitro splicing assays in 10 |il were performed essentially as described (Ohno 

20 and Shimura, supra) except the amount of HeLa cell nuclear extract used was reduced 
by a half (13 milligrams per milUliter) in the reaction mixture. In the pre-incubation 
experiments, the splicing mixtxire was incubated with the indicated amount of either 
SMNwt, SMNY272C, SMNAEx7, or SMNAN27 recombinant protein for 20 minutes 
at 30°C. Pre-mRNA was added and the mixture was incubated for an additional 20 or 

25 40 minutes at 30°C. For antibody inhibition experiments, all antibodies were purified 

using a protein-G Sepharose column (Phamiacia LKB Biotechnology Inc., Piscataway, 
NJ) and then dialyzed against buffer D (Dignam et al., 1983, NucL Acids Res. 
11:1 475-1489). Purified antibodies were incubated with HeLa cell nuclear extract and 
lOXSP buffer (Ohno and Shimura, 1996, Genes & Dev. 10:997-1007) for 20 minutes 

30 at 30°C. After addition of labeled pre-mRNA, the mixture was incubated for an 



additional 20 or 40 minutes at 30°C. RNA products were analyzed by electrophoresis 
on a 6% acrylamide/8.3 M urea gel for pSP14-15 RNA, and on a 10% 
acrylamide/8.3M urea gel for Ad-2 AIVS RNA, followed by autoradiography. 

Analysis of splicing complex formation by RNP gel electrophoresis 
5 For analysis of splicing complexes, splicing reactions were carried out 

as described preyiously herein using a-"'^P-labeled chicken 6-crystalline pre-mRNA. 
After a 30 minute incubation at 30oC, 50 micrograms of heparin (Sigma) were added 
to each reaction and the reaction was placed on ice for 10 minutes. An RNP natiye gel 
electrophoresis on a 3.75% polyacrylamide gel was performed as described in 
10 Konarska (1989, Methods Enzymol. 180:442-453). 

The Results of the experiments presented in this example are now 

described. 

Dominant-neeatiye SMN mutant causes reorganization of snRNPs, 
gems, and coiled bodies 

15 To define the functional domains of SMN, various deletion mutants 

were constructed and their interactions with SIPl and the Sm proteins were examined 
both in vitro by binding assays, and in vivo by co-immunoprecipitation after 
transfections into mammalian cells. Moreover, the expression and cellular localization 
of the myc-tagged mutants was monitored as well as their effect on snRNP 

20 localization. A particularly striking effect was observed by transfection of amino 

terminal deletion mutants, and one of these, a mutant lacking the first 27 amino acids 
of SMN, SMNAN27, was investigated in detail. 

Double-label immunofluorescence using anti-myc-tag antibodies to 
detect either the transfected myc-SMN or myc-SMNAN27, and the anti-Sm antibody 

25 Y12, demonstrated accumulation in the cytoplasm of Sm proteins colocalized with the 

mutant SMNAN27 in discrete aggregates (Figure 16A). In the nucleus, a striking 
rearrangement of snRNPs (and possibly also some Sm proteins) was observed and they 
also co-localized with SMNAN27. In contrast, in cells transfected with wild-type myc- 
tagged SMN (Figure 16A and 16C), as in untransfected cells, there was a barely 

30 detectable signal of Sm proteins in the cytoplasm, and the nuclear staining of Sm 

10873 V.2 117 



snRNPs demonstrated general nucleoplasmic distribution with higher local 
concentrations in interchromatin granules (also referred to as "speckles"), and 
particularly intense staining in coiled bodies (Carmo-Fonseca et ah, 1991, EMBO J. 
10:195-206; Huang and Spector, 1992, Proc. Natl. Acad Sci. USA 89:305-308). In 
5 these cells, SMN staining of gems was visible adjacent to and in most cases 

overlapping with that of coiled bodies (Figure 16C). There was a higher diffuse 
nucleoplasmic staining of myc-SMN than is normally seen in untransfected cells likely 
due to the overexpression of the protein, but the size, number and localization of gems 
was unchanged relative to untransfected cells. 

10 In the SMNAN27 transfected cells, endogenous SMN and SIPl were 

completely co-localized with SMNAN27 and with snRNPs. The number of the merged 
SMN- and snRNP-containing bodies in the SMNAN27 transfected cells was greater 
than the number of gems (or of coiled bodies) in control cells, and they were much 
larger than gems and coiled bodies. These merged structures could become as large as 

15 the nucleoli. 

The localization of a specific spUceosomal snRNA in cells transfected 
with SMNAN27 was also studied. To do so, the localization of U2 snRNA was 
determined by in situ hybridization using a U2-specific antisense probe as described by 
Matera and Ward (1993, J. Cell Biol. 121:715-727). Figure 17A and 17C demonstrate 

20 that the cytoplasmic accumulations that stained intensely for both the SMNAN27 and 
Sm proteins also contained U2 snRNA, while cells transfected with wild-type SMN 
exhibited only slight cytoplasmic backgroimd staining. This suggests that complexes 
containing Sm proteins, snRNAs and SMN accumulated in the cytoplasm of 
SMNAN27 but not SMN wt cells, likely representing a block in the pathway of 

25 cytoplasmic snRNP assembly. 

This block in cytoplasmic snRNP assembly was examined further by 
immunostaining using an anti-trimethyl-G cap (TMG)specific antibody. The TMG cap 
is formed by hypermethylation of the 5' monomethyl G cap of the snRNAs, and this 
step has been shown to take place in the cytoplasm after Sm core assembly has 

30 occurred (Mattaj, 1986, Cell 46:905-91 1; Fischer and Luhrmann, 1990, Science 



10873 V.2 



-138- 



249:786-790). The immunostaining indicated that although the cytoplasmic 
accumulations caused by SMNAN27 contained U2 as well as Sm proteins and SMN, 
the snRNAs within them did not receive the TMG modification (Figure 17D and 17F). 
Although the possibility that lack of TMG immunostaining in the cytoplasm is a result 
5 of masking cannot be excluded, the complete absence of signal and the strong staining 
in the nucleus make this unlikely. Thus, without wishing to be bound by theory, it 
appears that the snRNP assembly pathway in the cytoplasm is arrested at a step 
preceding the cap hypermethylation. The snRNAs in the nuclei of the same cells, 
however, are hypermethylated, since they stain efficiently with the anti-TMG antibody. 

10 These effects of SMNAN27 suggest that the construct has a dominant-negative 
phenotype over wild type SMN. These data also suggest that SMN and SIPl 
Gemin2) interact with Sm proteins in the cytoplasm, and, importantly, also with 
snRNPs in the nucleus. Further, these data demonstrate that mutations in SMN can 
have profound effects on both the biogenesis and localization of spliceosomal snRNPs 

15 in the cell. 

Gems, the structures in which SMN and SEPl are most highly 
concentrated in the nucleus, are usually found adjacent to and often merged with coiled 
bodies (Liu and Dreyfuss, 1996, EMBO J. 15:3555-3565; Liu et aL, 1997, Cell * 
90: 1 0 1 3- 1 02 1 ). Therefore, it was determined whether the SMN mutant, SMNAN27, 

20 had any effect on the structure or organization of coiled bodies by staining the cells 
with antibodies to p80 coilin, a coiled bodies-specific marker (Andrade et al., 1991, J. 
Exp. Med. 173:1407-1419). The images depicting inmiunomicrographs demonstrate 
that coiled bodies and gems became completely merged in the nucleus and that the p80 
coilin staining completely coincided with the staining of the enlarged gems (Figure 

25 1 8 A-F). These enlarged merged nuclear structures therefore contained snRNPs, coiled 

bodies, and gem components. The reorganization of coiled bodies that SMNAN27 
brings about suggests a functional relationship between SMN and coiled bodies. 
SMNAN27 inhibits pre-mRNA splicing when added during 

preincubation 



10873 V.2 



-139- 



The profound effect of SMN on the organization of nuclear 
snRNPs, particularly the formation of large snRNP-containing aggregates which are 
similar in appearance to those observed in cells in which snRNPs are inactivated by 
antisense oligonucleotides (O'Keefe et al, 1994, J. Cell Biol. 124:249-260) or when 
transcription is inhibited by actinomycin D (Zeng et al., 1997, EMBO J. 16:1401- 
1412), suggest that SMNAN27 can cause an inactivation of snRNPs and, possibly, of 
other pre-mRNA splicing factors. To address this directly, the effect of SMNAN27 on 
pre-mRNA splicing was examined in an in vitro system. 

SMN wt and SMNAN27 were produced in bacteria as recombinant 
proteins bearing a His-tag, the fusion proteins were purified to homogeneity and then 
they were added to nuclear splicing extracts. ^^P-labeled chicken 5-crystallin pre- 
mRNA was used as a splicing probe and the reaction products were analyzed by gel 
electrophoresis. Addition of SMN wt or the SMNAN27 mutant fusion protein to the 
reaction at time zero had ahnost no effect on the spUcing reaction, although both fusion 
proteins appeared to stabilize the intron (Figure 19A). 

It was next determined whether SMN, while not a splicing factor per se, 
may be important for some function akin to its role in snRNP assembly in the 
cytoplasm. That is, although nuclear snRNPs are thought to be stable and fully 
assembled, they may need to be regenerated or reassembled between rounds of 
splicing, and SMN may be required for this process. To address this possibility, the 
splicing extract was allowed to run m the presence of SMNAN27 before the labeled 
pre-mRNA probe was added. Therefore, the extract was pre-incubated with all the 
components, including an energy generating system and SMN wt or SMNAN27, but 
without the pre-mRNA for 20 minutes, at which time the 5-crystallin pre-mRNA 
probe was added and the reaction was allowed to proceed for an 
additional 20 or 40 minutes. A strong inhibition of splicing was 
observed only in the samples to which the SMNAN27 was added during 
the pre-incubation period (Figure 19B). Identical amounts of SMN wt 
and of SMNAN27 were used in the splicing reactions, and the inhibition 
demonstrated in samples preincubated with SMNAN27 was 



concentration-dependent. About three- to five-fold mass excess of 
SMNAN27 over endogenous SMN was sufficient for complete inhibition 
of 8-crystallin pre-mRNA splicing. 

SMNAN27 blocks the fomiation of spliceosomal complex C and 
5 SMNwt stimulates spliceosome formation when added during preincubation 

The observation that preincubation with SMNAN27 causes inhibition of 
splicing, including a block to the first step of the pre-mRNA reaction, cleavage at the 5' 
splice site, and lariat intron formation, indicated a block to an early step in the 
spliceosome formation. In order to characterize this block in greater detail and to 

10 determine whether the snRNPs could interact with the pre-mRNA at all, the RNP 

complexes were analyzed by native gel electrophoresis (Konarska, 1989, Methods 
Enzymol. 180:442-453). Immediately upon addition to a nuclear extract, the pre- 
mRNA rapidly forms an ATP-independent complex with endogenous RNA-binding 
hnRNP proteins termed H complex (Figure 20, lane 1). With further incubation at 

15 30oC in the presence of ATP, several larger complexes are observed (Figure 20, lane 
2), which are designated A, B, and C complex (Figure 23; see also Ohno and Shimura, 
1996, Genes & Dev. 10:997-1007). Consistent with the notion that allowing the 
extract to preincubate under splicing conditions leads to a reduction in active 
components, the capacity of the extract to form splicing complexes was significantly 

20 reduced during a 20 minute preincubation (Figure 20, lane 3). However, SMNwt 

prevented this reduction so that the extract retained and in fact exhibited increased 
efficiency of splicing complex formation (Figure 20, lane 4). In contrast, although 
extracts to which SMNAN27 has been added show a higher capacity to form A and B 
complexes compared with untreated extract, the formation of the C complex, which is 

25 the mature spliceosome, is completely inhibited (Figure 20, lane 5). These results 

suggest that SMN has an important role in maintaining snRNPs in active form and that 
the amino-terminal 27 amino acids of the protein are critical for this process. 

Antibodies to the amino terminus of SMN inhibit and antibodies to the 
carboxvl ter minus stimulate pre-mRNA solicine when added during preincubation 



10873 V.2 



-141- 



Amino terminal deletions in the 5MYgene, or other dominant-negative 
mutations in SMN, have not been found so far in SMA patients, who, in the vast 
majority of cases, exhibit only dramatic reduction in the level of the wild-type SMN 
protein. To mimic this situation and to address the role of SMN in pre-mRNA spHcing, 
5 the available amount of the SMN protein was reduced without the addition of an SMN 
mutant. 

SMN is tightly associated with SIPl, this interaction being resistant to 
dissociation with 1 M NaCl, and SMN also associated with other proteins thus 
precluding the use of immunodepletion as a way of removing exclusively SMN from 

10 the extract. Therefore, purified anti-SMN monoclonal antibody (2B1) was added to the 

extract to determine whether anti-SMN could cause specific immunoinhibition of pre- 
mRNA splicing. The results, shown in Figure 21, demonstrate that 2B1 strongly 
inhibited splicing but, again, only if it was added during the pre-incubation period. The 
lack of inhibition seen when the antibody was added at the same time as the probe pre- 

15 mRNA indicated that 2B1 was not simply toxic to the reaction. Consistent with the 
dominant-negative effect of SMNAN27, the first 27 amino acids of SMN are part of 
the epitope recognized by 2B1. The specificity of the 2B1 effect was illustrated by the 
lack of inhibition by the control antibody SP2/0, and fiuther by the lack of inhibition 
by an affinity purified rabbit polyclonal antibody raised against a SMN peptide 

20 corresponding to the extreme C-terminus encoded by exon 7. In fact, the anti-exon 7 
antibody had a strong stimulatory effect. Further, the data demonstrate that anti-SMN 
(2B1) was as strong an inhibitor of pre-mRNA splicing as the anti-snRNP antibody 
Y12 (Figure 22; Padgett et al., 1983, Cell 35:101-107). 

Unlike SMNwt, SMN mutants foimd in SMA patients do not stimulate 

25 splicing when added during preincubation 

In addition to the homozygous deletion of both telomeric SMN genes, 
several point mutations and partial deletions of the carboxyl terminus of SMN have 
been identified in SMA patients as reviewed in Burghes (1997, Am. J. Hum. Genet. 
61 :9-15). The effects of two mutations leading to the severe type I SMA phenotype, 

30 the Y272C point mutation and the deletion of the amino acid sequence encoded by the 

10873 V.2 \A0 



exon 7 (Lefebvre et al., 1995, Cell 89:155-165; Burghes, 1997, Am. J. Hum. Genet. 
61 :9-15), have been investigated. SMN deleted of exon 7 (designated SMNAEx7) is 
also thought to be the main fomi produced by the centromeric SMNgQUQ (GennarelH et 
al, 1995, Biochem. Biophys. Res. Commun. 213:342-348). These loss-of-function 
5 mutations of SMN are recessive and result in the SMA phenotype in individuals 

carrying a deletion of the other telomeric iSA^V allele. As these are recessive mutations, 
they were not expected to produce the inhibitory effects of SMNAN27. Nonetheless, it 
was determined whether these recessive mutations could substitute for SMNwt in 
providing a stimulatory effect during preincubation. These experiments were 
10 performed using the adenovirus 2 major late transcription unit-derived pre-mRNA, Ad- 
2 AIVS (Konarska et aL, 1984, Cell 38:731-736), rather than the 5-crystallin pre- 
mRNA because it was also sought to be determined whether the inhibitory effect of 
SMNAN27 was a general phenomenon, as opposed to an effect that is unique to 5- 
crystallin pre-mRNA. 

15 Similar to its effect on 5-crystallin pre-mRNA splicing, addition of 

SMNAN27 during the pre-incubation time inhibited the splicing of Ad-2 AIVS pre- 
mRNA (Figure 22). In contrast, addition of the wild-type SMN had a strong 
stimulatory effect on adenovirus pre-mRNA splicing. Neither SMNY272C nor 
SMNAEx7 had any detectable effect, demonstrating neither stimulation nor inhibition 

20 of pre-mRNA splicing. The recombinant proteins used in these experiments were 

produced side-by-side using the same procedure, and identical amoxmts were added to 
each assay. All these SMN recombinant proteins were able to directly bind a GST- 
SIP 1 fusion protein in vitro, further suggesting that the recombinant proteins were 
purified in native form. These findings demonstrate that SMN has an important and 

25 unexpected general role in pre-mRNA splicing, and these data demonstrate a direct 

connection between the molecular defect of SMA and the pre-mRNA splicing cycle. 

The data disclosed herein demonstrate that SMN has functions both in 
the cytoplasm and in the nucleus in both snRNP biogenesis and function. Previous 
experiments in Xenopus oocytes disclosed previously elsewhere herein demonstrated a 

30 role for SIPl in the biogenesis of snRNPs by serving as an assembly factor for the 

10873 V.2 lA-J 



snRNAs and the snRNP Sm core proteins. Those experiments, however, did not reveal 
as clear and direct a role for SMN as for SEPl, because while microinjections of anti- 
SIPl antibodies strongly inhibited snRNP assembly, the anti-SMN antibodies had a 
stimulatory effect {see Example 2). The experiments in somatic cells reported herein 
5 demonstrate that transfection of an SMN mutant blocks snRNP assembly in the 
cytoplasm, indicating that SMN, like SIPl, is critical for this process. The data 
disclosed herein further demonstrate that the requirement for SMN in snRNP assembly 
is a general one and not a phenomenon that is unique to amphibian oocytes. 

Although the presence of SMN and SIPl in the nucleus and their high 

10 concentration in gems, adjacent to and often merged with the snRNP-rich coiled 

bodies, suggested a function for these proteins in the activity of snRNPs in the nucleus, 
the strong effect of SMNAN27 on nuclear snRNP organization was unexpected. As 
this raised the possibility of an involvement of these proteins, particularly of SMN, in 
the activity of snRNPs in pre-mRNA splicing, this possibility was examined. The data 

15 disclosed herein point to a novel function of SMN in the pre-RNA splicing cycle and 
suggest that some components of the spliceosome, likely snRNPs but possibly also 
other components, require SMN for their function. The lack of an effect of SMNAN27 
upon its addition to a splicing extract indicates that SMN is neither a general inhibitor 
of splicing nor a splicing factor per se. Rather, inhibition of splicing by SMNAN27 

20 and by the anti-SMN antibody 2B1 are seen only if the mutant or the antibody are 
added to the extract and the reaction is preincubated before the pre-mRNA probe is 
added. This inhibition, which is not observed with wild-type SMN and which is 
general rather than pre-mRNA specific, suggest a role for SMN in regeneration of 
snRNPs (and possibly also of other components). Without wishing to be bound by any 

25 particular theory, the data disclosed herein suggest that during the incubation of the 

nuclear extract under splicing conditions, including an energy-regenerating system, 
some of its components, likely including snRNPs, become inactivated and their 
regeneration to functional form requires SMN. Indeed, addition of recombinant 
SMNwt greatly stimulates both spliceosome formation and splicing. In splicing 

30 competent extracts, endogenous SMN is sufficient to provide this activity, but addition 



10873 V.2 



-144- 



of SMNAN27 causes a block to this regeneration process and acts as a dominant- 
negative mutant of SMN because it is able to engage in some of the same interactions 
as SMN but the mutant protein traps complexes in a nonfunctional state. The 
interactions of SMNAN27 with Sm proteins, with SIPl, and with SMN are 
5 indistinguishable from that of wild-type SMN. SMN mutations found in SMA patients 
do not show either a stimulation or an inhibition of spUcing, confirming that they 
represent loss-of-fimction recessive mutants. Since SMNY272C and SMNAEx? have 
a highly reduced oUgomerization capacity (Lorson et al, 1998, Nature Genet. 19:63- 
66), SMN oligomerization may be required for its splicing-regenerating activity. It 

10 should be noted that although the general term "regeneration" is used to describe this 
function of SMN, the actual mechanism involved is not yet known and the present 
invention is not limited to any particular mechanism of SMN action. 

Several recycling factors that are essential for sphcing, mostly 
DEAD/DEAH box RNA helicases, have been described previously. Recycling factors, 

15 exemplified by S. cerevisiae Prp22 and Prp43, function in the disassembly of snRNPs, 
splicing factors, intron lariat, and spliced mRNA from the spliceosome (Staley and 
Guthrie, 1998, Cell 92:315-326). For example, Prp22 is needed for releasing the 
mRNA from the spliceosome and therefore yeast cells that are made deficient in 
functional Prp22 accumulate mRNA on spliceosomes in the nucleus but there is no 

20 block in pre-mRNA spUcing (Company et aL, 1991, Nature 349:487-493). Similarly, 
deficiency in Prp43, a recycling factor required for disassembly of U snRNPs-intron 
lariat complex, does not result in inhibition of splicing (Arenas and Abelson, 1997, 
Proc. Natl. Acad. Sci. USA 94:1 1798-1 1802). Splicing extracts preincubated with 
SMNAN27 do not show any cleavage of the pre-mRNA indicating that even the first 

25 step in the splicing reaction, Le., the cleavage at the 5' splice site and the concomitant 

formation of the intron lariat, has not taken place. This is somewhat similar to what is 
seen upon depletion of Prp24, the recycling factor required for re-annealing of U4 and 
U6 snRNPs (Raghunathan and Guthrie, 1998, Science 279:857-860). As disclosed 
herein, the RNP gel analysis suggests that there is a deficiency in functional 

30 components that are required for spliceosome complex C formation. Future 

10873 V.2 \AC 



experiments will characterize the block to C complex foraiation and attempt to define 
the specific defect that results from incubation of the extract with SMNAN27 or from 
reduced levels of SMN. Further, unlike recycling factors described so far, SMN and 
SIPl do not contain DEAD/DEAH motifs. However, as disclosed previously 
5 elsewhere herein, additional proteins are found in the SMN/SIPl complex, and it is 
possible that one of these proteins has such an activity. Indeed, the data disclosed 
elsewhere herein demonstrate that a novel SMN-interacting protein, Gemin3, contains 
a DEAD/DEAH motif {see Example 6). 

Without wishing to be bound by any particular theory, the data 

10 disclosed herein demonstrate that the factors requiring regeneration, presumably 
snRNPs, must exist in two states, inactive and active, and that the conversion of 
inactive to active forms requires SMN. The conversion of active to inactive may be a 
consequence of their function in splicing or it may be an intrinsic switch that they 
undergo independent of splicing. By analogy to the function of SMN and SIPl in the 

15 cytoplasm, it may be that snRNPs in the nucleus, although thought of as stable, fully 
assembled, and functional RNPs, in fact undergo some disassembly or rearrangements, 
and SMN and SIPl are required for their reassembly to regenerate them into fimctional 
form. A scheme depicting this proposed view of the place of SMN in the pre-mRNA 
splicing cycle is shown in Figure 23. The nuclear functions of SIPl are not yet known 

20 and are being investigated. 

Coiled bodies were first described in 1903 by Ram6n y Cajal, who 
observed them in neuronal cells and named them nucleolar accessory bodies (Ramon y 
Cajal, 1903, Trab. Lab. Invest. Biol. 2:129-221). Over the past several years, coiled 
bodies have received much renewed interest as they have been found to contain the 

25 highest concentration of snRNPs in the nucleus, and additional data have raised the 

possibility that they have important, albeit not clearly defined, roles in the formation of 
both splicing components and the nucleolus (Raska et al, 1990, J. Struct. Biol. 
104:120-127; Lamond and Carmo-Fonesca, 1993, Trends Cell Biol. 3:198-204; 
Bohmann et al, 1995, J. Cell Biol 131:817-831; Roth, 1995, Curr. Opin. Cell Biol. 

30 7:325-328; Lamond and Eamshaw, 1998, Science 280:547-553). The effect of 



10873 V.2 



-146- 



SMNAN27 on coiled bodies provides insight into the function of both gems and coiled 
bodies. It indicates that there is a functional relationship between these two structures 
and, in particular, that SMN affects the organization, and likely the function, of coiled 
bodies. It is possible that gems and coiled bodies represent two stations in a pathway 
5 along which snRNPs need to proceed to become functional. At this stage, the order of 
steps between these stations cannot be assigned. The data disclosed herein directly link 
the functions of coiled bodies and gems to pre-mRNA splicing components, 
specifically to a regeneration of snRNPs. 

The functions disclosed for the first time herein for SMN and SIPl 

10 define them as critical proteins for the generation of the splicing machinery and thus 
ultimately for the process of mRNA biogenesis. Consistent with a housekeeping 
function for SMN, SMN knockout mice display an early embryonic lethal phenotype 
(Schrank et al., 1997, Proc. Natl. Acad. Sci. USA 94:9920-9925). Most SMA patients, 
particularly those with the fatal infant form, SMA type I, have a drastic reduction in the 

15 amount of SMN in motor neurons (Lefebvre et al, 1997, Nature Genet. 16:265-269). 
In some cases, SMA patients produce a protein bearing a recessive mutation like the 
SMNY272C and SMNAEx7 analyzed here. It is therefore important that, as disclosed 
herein, these mutants do not have the capacity to provide the activity of wild-type SMN 
in splicing. Together, these novel findings presented here strongly suggest that motor 

20 neurons of SMA patients are impaired in their capacity to produce mRNAs and as a 
result, become deficient in proteins that are necessary for the growth and functions of 
these cells. 

In conclusion, SMA is a human splicing disease, and better knowledge 
of the precise molecular details of the mechanism of action of SMN in this process will 
25 likely facilitate the search for a therapeutic approach to this devastating disease. 

Although SMA is considered to be an autosomal recessive disease, the phenotype of 
SMNAN27 demonstrates that some mutations in SMN will have a dominant-negative 
lethal phenotype. 

Example 5: GeminS: A novel DEAD box protein that is a component 
30 of gems and that interacts with SMN 



10873 V.2 



-147- 



The experiments presented in this example may be summarized as 

follows. 

The data disclosed herein demonstrate the cloning and characterization 
of a heretofore unknown protein that is a component of the SMN complex, a novel 
5 DEAD box putative RNA helicase, designated "GeminS" (for protein component of 
Gem s number 3). Gemin3 interacts directly with SMN as well as with SmB, SmD2 
and SmD3. Immunolocalization studies using monoclonal antibodies to Gemin3 
demonstrated that Gemin3 co-locahzes with SMN in gems. Gemin3 binds with SMN 
via its unique C-terminal domain, and SMN mutations found in some SMA patients 

10 strongly reduce this interaction. Unlike SMN and SIPl which do not contain 

DEAD/DEAH motifs (reviewed in Staley and Guthrie, 1998, Cell 92:315-326), 
Gemin3 contains a DEAD box motif indicating that the protein is a putative helicase. 
Thus, the presence of a DEAD motif in Gemin3 suggests that this protein may provide 
a catalytic activity that plays a critical role in the function of the SMN complex on 

15 RNPs. 

The data disclosed herein further demonstrate the production of 
monoclonal antibodies to Gemin3. In addition, immunofluorescence microscopy 
demonstrated that Gemin3 co-localizes with SMN in gems. Like SMN and SIPl, 
Gemin3 can be isolated in a complex with several spliceosomal snRNP proteins. 

20 Moreover, the data disclosed demonstrate that Gemin3 interacts directly with SMN and 
with several of the spliceosomal snRNP core Sm proteins including the B and D2-3 
proteins. The unique C-terminal domain of Gemin3 mediates interaction with SMN 
and localization of Gemin3 to gems. The discovery of a DEAD box protein (/.e, a 
likely RNA helicase) in the SMN complex is of particular interest as the functions 

25 revealed so far suggest that this complex has crucial activities in the biogenesis of 

RNPs. To perform such functions, including assembly of the snRNPs and the 
regeneration of active components of the spliceosome, the SMN complex may effect 
structural changes in its RNP targets. Of the known components of the SMN complex, 
the DEAD box protein Gemin3 is the most likely protein to have the capacity to 

30 perform such a function. Importantly, SMN proteins with mutations foimd in SMA 

10873 V.2 - 148 - 



patients exhibited significantly reduced interaction with Gemin3 suggesting that the 
SMN complexes in these patients are be deficient in this protein. 

The Materials and Methods used in the experiments presented in this 
example are now described. 
5 Identification of pi OS protein bv mass spectrometry 

The pl05 protein was co-immonoprecipitated with anti-SMN 
monoclonal antibody 2B1 and the band was excised fi"om a single one-dimensional 
Coomassie stained polyacrylamide gel. The protein band was digested with trypsin 
(unmodified trypsin, sequencing grade, Boehringer Mannheim, Indianapolis, IN) in-gel 
10 as described in Shevchenko et al. (1996, Anal. Chem. 68:850-858). Tryptic peptides 
were extracted and recovered firom the gel using 5 % formic acid and acetonitrile. The 
combined extracts were pooled together, dried in a speed vacuum, and the dried pellets 
were redissolved in 5 % formic acid. The peptides were analyzed by nanoelectrospray 
tandem mass spectrometry as described in Wilm and Mann (1996, Anal. chem. 66:1-8). 
15 Nano ES MS/MS was performed on a API III triple quadrupole instrument (PE Sciex, 
Ontario, Canada) equipped with a nanoelectrospray ions source developed in EMBL 
(Wilm and Mann, 1996, Anal. chem. 66:1-8). 

Comprehensive protein and EST databases were searched using 
PeptideSearch version 3.0 software. No limitations on protein molecular weight and 
20 species of origin were imposed. 

Production of proteins in vitro 

[^^SJmethionine labeled proteins were produced by an in vitro coupled 
transcription-translation reaction (Promega Biotech, Madison, WI) in the presence of 
[^^S]methionine (Amersham, Arlington Heights, IL). GeminS and SMN fiision 
25 proteins, comprising a tag polypeptide comprising a run of histidine amino acid 

residues (6His) to produce 6His-Gemin3 and 6His-SMN fixsion protein, were expressed 
fi-om a pET bacterial expression system in the E. coli strain BL21(DE3) and the fiision 
proteins were purified using nickel chelation chromatography using a kit (His-Bind 
buffer kit; Novagen, Madison, WI) per the manufacturer's instructions. 



10873 V.2 



-149- 



Gemin3 fusion protein comprising a glutathione-S-transferase (GST) tag 
polypeptide (/.e., GST-Gemin3) was expressed using a GST expression vector (pGEX- 
5X-3; Pharmacia LKB Biotechnology Inc., Piscataway, NJ) in the E. coli strain BL21. 
The Gemin3-GST fusion protein was purified using glutathione-Sepharose (Pharmacia 
5 LKB Biotechnology Inc., Piscataway, NJ) according to the manufacturer's protocol. 

Production of monoclonal antibodies that specifically bind Gemin3 

Anti-Gemin3 antibodies, designated 1 1G9 and 12H12, were prepared by 
immunizing B ALB/c mice with 6His-tag C-terminal domain of Gemin3 (comprising 
from about amino acid residue number 368 to about 548) purified from nickel chelation 
10 chromatography using a Novagen (Madison, WI) His-Bind buffer kit per the 

manufacturer's instructions. Hybridoma production, screening and ascites fluid 
production were performed as described previously by Choi and Dreyfiiss (1984, J. 
Cell. Biol. 99:1997-2004). 

I mmunoprecipitation and immunoblotting 
15 Immunoprecipitations of in vitro translated proteins were carried out in 

the presence of 1% Empigen BB buffer as previously described (Choi and Dreyfiiss, 
1984, J. Cell. Biol. 99:1997-2004). Immunoprecipitations of SMN, the Sm proteins 
and Gemin3 from cells were carried out using total HeLa cell lysate in the presence of 
1% Empigen BB buffer as previously described (Choi and Dreyfuss, 1984, supra). 
20 Immunoprecipitations and purifications of the SMN, Gemin2, Sm and Gemin3 
complexes were carried out using total HeLa cell lysate in the presence of 0.5% 
TritonX-100 as previously described in Pifiol-Roma et al. (1988, Genes Dev. 2:215- 
227). 

For immunoblotting, the HeLa cell lysate proteins were resolved using 
25 12.5% SDS-polyacrylamide gels and the proteins were then transferred to a 

nitrocellulose membrane (Schneider and Schuell, Inc., Keene, NH) using a BioTrans 
Model B Transblot apparatus (Gelman Science) according to the manufacturer's 
instructions. The membranes were then incubated in blocking solution (phosphate- 
buffered saline, PBS, containing 5% (w/v) nonfat milk) for at least 1 hour at room 
30 temperature. The membranes were rinsed with cold PBS, and then were incubated in 



10873 V.2 



-150- 



blocking solution containing a selected primary antibody for at least 1 hour at room 
temperature. The membranes were subsequently washed three times in PBS containing 
0.05% (v/v) NP-40, and any bound antibodies were detected using peroxidase- 
conjugated goat anti-mouse IgG plus IgM (Jackson Immunoresearch Laboratories, 
West Grove, PA). The antibody-decorated protein bands were visualized using an 
enhanced chemiluminescence (ECL) Western blotting kit (Amersham, Arlington 
Heights, IL) after washing the membranes an additional three times with PBS 
containing 0.05% (v/v) NP-40. 

Cell culture and treatments 

HeLa cells were cultured in Dulbecco*s modified Eagle's medium 
(DMEM; GIBCO BRL, Gaithersburg, MD) supplemented with 10% (v/v) fetal bovine 
serum (FBS; GIBCO BRL, Gaithersburg, MD). 

Immunofluorescence microscopy 

Immunofluorescence staining was carried out essentially as described 
previously in Choi and Dreyfuss (1984, J. Cell. Biol. 99:1997-2004). Double-label 
immunofluorescence experiments were performed using separate, sequential 
incubations of each primary antibody each diluted in PBS containing 3% (w/v) bovine 
serum albumin (BSA) followed by incubation using the specific secondary antibody 
coupled to either fluorescein isothiocyanate (FITC) or Texas Red. All incubations in 
antibodies were carried out at room temperature for 1 hour. Laser confocal 
fluorescence microscopy was performed using a Leica Model TCS 4D confocal 
microscope (Leica, Inc., Exton, PA). Images from each channel were recorded 
separately and stored in separate data files. Subsequently, where indicated, the 
separate data files were merged. 

The antibodies used in these experiments were as follows: mouse IgGl 
monoclonal anti-Gemin3 (1 1G9 and 12H12), mouse IgGl monoclonal anti-SMN 
(2B1); rabbit polyserum anti-p80 coilin (R288, Andrade et al., 1993, Proc. Natl. Acad. 
Sci. USA 90:1947-1951); mouse IgG3 monoclonal anti-Sm (Y12, Lemer et al, 1981, 
Proc. Natl. Acad. Sci. USA 78:2737-2741) and SP2/0, a non-immunoglobulin chains 



10873 V.2 



-151- 



secreting mouse hybridoma (American Type Tissue Collection [ATTC], Rockville, 
MD). The rabbit aflfinity purified anti-Exon 7 antibody is described elsewhere herein. 
In vitro protein-binding assay 

Purified GST or GST-fusion proteins (2 micrograms) bound to 25 of 
5 glutathione-Sepharose beads were incubated with 10^ cpm of the in vitro translated 

protein mixture in 1 ml of binding buffer (50 mM Tris-HCl [pH7.5], 200 mM NaCl, 2 
mM EDTA, 0.1% NP40, 2 micrograms per milliliter leupeptin and pepstatin A, and 
0.5% aprotinin). Following incubation for 1 hour at 4°C, the resin was washed five 
times with 1 ml of binding buffer. The bound protein fraction was eluted by boiling 

10 the beads in SDS-PAGE sample buffer, and the eluted proteins were run on SDS- 

PAGE as described previously elsewhere herein. The gels were fixed for 30 minutes 
and the radiolabeled signal was enhanced by treating the gels with AmpUfy solution 
(Amersham, Arlington Heights, IL). 

For direct in vitro binding, purified GST or GST-Gemin3 proteins (2 

15 micrograms) bound to 25 microliters of glutathione-Sepharose beads were incubated 
with 5 micrograms of purified 6His-tag-SMN or 6His-tag-SmB in 1 ml of binding 
buffer (50 mM Tris-HCl [pH7.5], 100 mM NaCl, 2 mM EDTA, 0.05% NP-40, 2 
micrograms per milliliter leupeptin and pepstatin A, and 0.5% aprotinin). After 
incubation for 1 hour at 4°C, the beads were washed five times with 1 ml of binding 

20 buffer per wash. The bound protein fraction was eluted by boiling the beads in SDS- 
PAGE sample buffer, and any 6His-SMN and/or 6His-SmB proteins eluted fi"om the 
beads were analyzed using SDS-PAGE and Westem blot using a rabbit polyclonal anti- 
His-tag antibody (Santa-Cruz Biotech., Santa Cruz, CA). 

Cell firactionation and chromatography 

25 HeLa cells were fractionated as described in Dignam et al. (1983, Nucl. 

Acids Res. 1 1:1475-1489 1983). SI 00 fractions (400 microliters comprising about 20 
milligrams per milliliter protein) in buffer F (20 mM Tris-HCl [pH 7.4], 0. 1 mM 
EDTA, 1 mM DTT, 10% Glycerol, 500 mM KCl) were loaded on a Superose 6 HR 
10/30 column (Pharmacia LKB Biotechnology Inc., Piscataway, NJ). The column was 

30 then washed with buffer A (20 mM Tris-HCl [pH 7.4], 200 mM NaCl , 2.5% Glycerol). 

10873 V.2 ^co 



Selected protein fractions (0.5 mi) were collected, and 30 [il of each fraction was 
resolved on SDS-PAGE followed by Western blotting. 

The Results of the experiments presented in this example are now 

described. 

5 Gemin3. a novel SMN-interacting protein with a DEAD box RNA 

helicase domain 

Immunoprecipitations from [^^S]methionine labeled Hela cell lysates 
with antiSMN and anti-SIPl monoclonal antibodies disclosed the presence of several 
protein components in the SMN-SIPl complex (Liu et al, 1997, Cell 90:1013-1021). 

10 Among the proteins that can be co-immunopurified with anti-SMN and anti-SIPl 

antibodies, only some of the major low molecular mass proteins, identified as the Sm 
proteins, have so far been characterized as disclosed elsewhere herein. In addition to 
SMN, SIPl, and the Sm proteins, there is a doublet at about 97 kDa and additional 
bands at 175 kDa, 95 kDa, 60 kDa and 50 kDa that co-immunopurified with the anti- 

1 5 SMN antibody. The two proteins of the 97 kDa doublet were eluted from the gel, 
digested with trypsin, and the resulting peptides were sequenced using 
nanoelectrospray mass spectrometry as described previously (Wilm et al., 1996, Anal. 
Chem. 66:1-8; Shevchenko et al., 1996, Anal. Chemistry 68: 850-858). The data 
disclosed herein demonstrate the molecular cloning and characterization of the high 

20 molecular weight protein of this doublet (i.e., pi 05). Several peptides from this band 
identified a human EST sequence (clone #AA303940) using the peptide sequence tag 
algorithm (Figure 2A and 2B. Several additional cDNA clones were obtained by 
hybridization screening of a human leukemia 5 -STRETCH PLUS cDNA library using 
this EST clone as a probe. Twelve independent partial cDNA clones with insert sizes 

25 ranging from about 1 to 2.5 kb, all of which contained overlapping regions of the same 

open reading frame (ORF), were isolated. 5* RACE PCR was used to extend this 
cDNA fiirther upstream. A cDNA clone containing the longest ORF was constmcted 
and conceptual translation of its nucleotide sequence disclosed a potential initiator 
methionine preceded by an in frame stop codon. This cDNA encodes a putative 

30 protein of 824 amino acids with a calculated molecular mass of 92.2 kDa and a pl of 

10873 V.2 ic-j 



6.5. Next, it was determined that this cDNA encodes the pl05 protein co- 
inununoprecipitates with SMN. Thus, this is a full-length cDNA clone (SEQ ID N0:1) 
(GenBank accession number AF171063) for a novel component of the SMN complex 
designated Gemin3 for component of gems number 3 (see below). Because of the 
5 existence of several unrelated proteins called SIPl (MyUn et al., 1994, Genetics 

137:689-700; Zhang et al., 1998, Mol. Cell. Biol. 18:676-684; Verschueren et al, 1999, 
J. Biol. Chem. 274:2089-2098), this protein has been tentatively renamed Gemin2, for 
component of gem s number 2 (SMN is the first component of gems identified, Liu and 
Dreyfuss, 1996, EMBO J. 15:3555-3565). Gemin3 has high amino acid sequence 

10 similarities with the RNA-helicase core region of the human eukaryotic initiation factor 
4A-II (eIF4A-II). eIF4A-II is a DEAD-box RNA helicase that belongs to the SFII 
superfamily of heUcases (reviewed in De la Cruz et al., 1999, TIBS 24: 192-198 ). A 
scheme depicting the modular structure of Gemin3 and the predicted amino acid 
sequence of Gemin3 aligned with the sequence of eIF4A-II is presented in Figure 26. 

15 This alignment disclosed the presence of seven motifs in the Gemin3 protein, motifs 
that are characteristic of the RNA helicase core region. Database searches with the C- 
terminal nonconserved region did not reveal significant homology to any other protein 
or to any recognizable motifs. 

Production of monoclonal antibodies to Gemin3 

20 To investigate the interaction of Gemin3 with SMN and to characterize 

Gemin3 fiirther, monoclonal antibodies that specifically bind Gemin3 were produced 
by immunizing mice with a purified, bacterially produced recombinant 6His-tagged 
Gemin3 fi-agment (amino-acids 368 to 548). Two hybridomas, 1 1G9 and 12H12, were 
selected for additional studies. The data disclosed herein demonstrate that these 

25 hybridomas produce monoclonal antibodies that recognize Gemin3 specifically. First, 
both 1 1G9 and 12H12 immunoprecipitate Gemin3 produced by in vitro transcription 
and translation fi-om the Gemin3 cDNA, but do not immunoprecipitate similarly 
produced hnRNP Al or SMN proteins (Figure 28A). Second, the monoclonal antibody 
1 1G9 efficiently recognized purified 6His-Gemin3 on Western blots but did not 

30 recognize similarly produced and purified 6His-tagged Gemin2 (Gemin2) (Figure 

10873 V.2 - 154 - 



28B). Finally, on an immunoblot of total Hela lysate, both 1 1G9 and 12H12 recognize 
a single protein of approximately 105 kDa (Figure 28C). Monoclonal antibodies 1 1G9 
or 12H12 did not recognize a specific protein on a Western blot with total mouse 3T3 
cell lysate or Xenopus laevis XL- 177 cell lysate. However, 1 1G9 specifically 
5 immunoprecipitated a single protein of about 105 kDa firom these cell lysates 
suggesting that Gemin3, like SMN, is conserved in vertebrates. 
GeminS and SMN co-localize in gems 
Indirect laser confocal immunofluorescence microscopy using 
antibodies 1 1G9 and 12H12 was performed on Hela cells to determine the subcellular 

10 localization of GeminS, Figure 29 A depicts that GeminS is found throughout the 

cytoplasm and also displays intense staining of prominent discrete nuclear bodies that 
are also readily discemable by differential interference contrast (DIG) microscopy 
(Figure 29B). This pattern is similar to that seen for SMN and Gemin2 (Liu and 
Dreyfiiss, 1996, EMBO J. 15:3555-3565; Example 1, supra), except that the 

1 5 nucleoplasmic staining of Gemin3 was stronger that the patterns detected for SMN and 
Gemin2 (SIPl). 

In order to assess whether the nuclear structures stained by 1 1G9 were 
gems or coiled bodies, double-label immunofluorescence experiments were performed 
using antibodies against Gemin3 and to either p80-coilin as a marker of coiled bodies 

20 (Andrade et al., 1991, L Exp. Med. 173: 1407-1419) or to SMN as a marker of gems 
(Liu and Dreyfiiss, 1996, EMBO J. 15:3555-3565; Figure 29A). In many cell lines 
gems and coiled bodies entirely overlap by antibody staining, however, in the HeLa PV 
strain used herein, these two bodies are frequently found separate from each other (Liu 
and Dreyfiiss, 1996, EMBO J. 15:3555-3565; Matera and Frey, 1998, Am. J. Hum. 

25 Genet. 63:3 17-321). Therefore, HeLa PV cells were used to examine whether Gemin3 

is located in gems or in coiled bodies. As depicted in Figure 29C-29D, the nuclear 
structures that contain Gemin3 were clearly distinct from coiled bodies but Gemin3 
completely co-localized with SMN in gems (Figure 29E). The co-localization of 
Gemin3 with SMN strongly supports the conclusion that these two proteins exist as a 

30 complex in the cell. Gemin3 is, thus, the third constituent of gems described so far. 

10873 V.2 ICC 



Gemin3 is in a complex with SMN. Gemin2 and the soliceosomal Sm 
proteins 

To characterize further the GeminS complex, immunoprecipitations 
using antiGeminS monoclonal antibodies and [^^S]methionine labeled HeLa cells were 
5 carried out in the presence of either Triton-X 100 or the more stringent detergent 
Empigen BB (Matunis et al., 1994, Methods Cell Biol. 44:191-205), The 
immunoprecipitated proteins were then analyzed by SDS-PAGE. As references for 
these immunoprecipitations, an inununoprecipitation with the anti-Sm monoclonal 
antibody Y12 (Lemer and Steitz, 1979, proc. Natl. Acad. Sci. USA 76:5495-5499; 

10 Lemer et al., 1981, Proc. Natl. Acad. Sci. USA 76:2737-2741) and an 

inununoprecipitation with the anti-SMN monoclonal antibody 2B1 were also included 
for purposes of comparison. As shown in Figure 30A, several proteins can be co- 
immunoprecipitated with Gemin3 and the pattern of inmiunoprecipitated proteins is 
very similar to that obtained with the anti-SMN antibody. In addition to Gemin3, SMN 

1 5 and Gemin2, there are several prominent bands at 1 75 kDa, 95 kDa and 50 kDa. The 
two groups of proteins at 28 kDa and 15 kDa have been identified previously as the Sm 
B/B\ DI-3, E, F and G proteins of snRNPs (e.g.. Example 1). In addition, there were 
protein bands which co-immunoprecipitated only with anti-SMN (at 60 kDa) or only 
anti-Gemin3 (at 1 15 kDa) monoclonal antibodies. As further evidence for the 

20 specificity of the antibodies used, the immunoprecipitations were performed in the 
presence of Empigen BB. Under these conditions, anti-GeminS and anti-SMN 
antibodies immunoprecipitate Gemin3 and SMN proteins respectively (+ Empigen BB, 
lane 1 1G9, and lane 2B1, respectively). Even in the presence of Empigen BB, a protein 
of 95 kDa was still present under these conditions in both of these 

25 immunoprecipitations, but not in the control SP2/0 inununoprecipitation suggesting 

that this unidentified protein interacts tightly with both Gemin3 and SMN. 

To confirm the co-immunopurification results, the interaction of 
Gemin3 with SMN, Gemin2 and the Sm proteins in HeLa cells was assessed in vivo 
using immunoprecipitations and Westem blot experiments. The anti-GeminS 

30 monoclonal antibody 1 1G9 was used for immimoprecipitation from total HeLa cell 



extracts, and these extracts were then resolved by SDS-PAGE, transferred to 
nitrocellulose, and an immunoblot was probed with the anti-SMN antibody (Liu and 
Dreyfuss, 1996, EMBO J. 15:3555-3565). As depicted in Figure 30C (lane 1 1G9 IP), 
2B1 readily detects SMN in the 1 1G9 imtnunoprecipitates indicating that SMN is co- 
5 immunoprecipitated with Gemin3, Because SMN is associated with Gemin2 to form a 
stable complex in vivo and in vitro (see, e.g.. Example 1), whether GeminS could be 
co-immxmoprecipitated with Gemin2 was determined. As disclosed in Figure 30C, the 
anti-Gemin2 monoclonal antibody 2S7 clearly detects Gemin2 in the anti-Gemin3 
1 1G9 immunoprecipitates (lane 1 1G9 IP). In a reciprocal experiment, the Gemin3 

10 protein could also be co-immunoprecipitated by the anti-SMN monoclonal antibody 
2B1 (Figure 30D, lane 2B1 IP) and the anti-Gemin2 monoclonal antibody 2S7 (Figure 
30D, lane 257 IP). Because SMN and Gemin2 are found in a complex with the Sm 
proteins, it was determined whether Gemin3 can be co-immunoprecipitated with the 
spliceosomal snRNP Sm core proteins as well. Figure 30D demonstrates that Gemin3 

15 is present in the anti-Sm monoclonal antibody Y12 immunoprecipitates (lane IP Y12) 
like SMN and Gemin2 (Liu et al., 1997, Cell 90:1013-1021). No Gemin3, SMN, 
Gemin2 or Sm proteins were detected in a SP2/0 immunoprecipitate. These results 
demonstrate that Gemin3, SMN, Gemin2, and the Sm proteins are associated in vivo in 
a complex that can be immunoprecipitated by either anti-SMN, anti-Gemin2, anti-Sm 

20 or anti-Gemin3 antibodies. 

Further support for the existence in vivo of a complex that contains 
SMN, Gemin2 and Gemin3 was obtained from gel filtration experiments. HeLa 
cytoplasmic SI 00 extract was fractionated on a Superose 6 HR 10/30 high performance 
gel filtration column and each fraction was subjected to SDS-PAGE followed by 

25 Western blot with anti-Gemin3, anti-SMN and anti-Gemin2 monoclonal antibodies. 
Gemin3, SMN and Gemin2 co-migrated and showed a peak at about 800 kDa 
demonstrating that they are components of a large macromolecular complex (Figure 
30E). A second pool of SMN-Gemin2, lacking Gemin3, was observed in a lower 
molecular weight complex which peaked at about 150 kDa suggesting that at least two 

30 different SMN-Gemin2 sub-complexes exist in vivo. However, without wishing to be 

10873 V.2 icT 



bound by any particular theory, the possibility cannot be excluded that the 150 kDa 
sub-complex corresponded to a fraction of SMN-Gemin2 that dissociated from GeminS 
dxuing cell fractionation and/or chromatography. The data previously disclosed 
elsewhere herein demonstrate that a SMN-Gemin2 complex migrates at about 300 kDa, 
5 or higher, after filtration of a cytoplasmic SlOO extract on a TSK-GEL G3000-SW 
column (Example 1). The Superose 6 HR 10/30 gel filtration column used herein 
permitted a better resolution of the cytoplasmic SMN complex and permitted a more 
accurate estimate as to its size which is about 800 kDa. 

Gemin3 interacts directly with SMN and the spliceosomal Sm proteins 

10 in vitro. 

To further analyze the Gemin3 complex, in vitro protein binding assay 
between Gemin3 and several components of the SMN complex were performed. For 
in vitro binding assays, Gemin3 was produced as a fusion protein with glutathione S- 
transferase (GST), and SMN and Gemin2 were produced and labeled with 

15 [^^S]methionine by in vitro transcription and translation in rabbit reticulocyte lysate. 
Purified GST or GST-Gemin3 fusion immobilized on glutathione-Sepharose were 
incubated with labeled SMN or Gemin2 proteins. Following extensive washing, bound 
proteins were eluted by boiling in SDS-containing sample buffer and the eluted 
material was analyzed by SDS-PAGE and detected by fluorography. Full length SMN, 

20 but not Gemin2, bound specifically to immobilized GST-Gemin3 (Figure 3 1 A) but not 
to GST alone. To investigate whether Gemin3 interacts with Sm proteins, purified 
GST or GST-Gemin3 recombinant proteins were used for binding assays with in vitro 
[^^SJmethionine labeled Sm proteins B, Dl, D2, D3, E, F and G (Lehmeier et al.,1994, 
Proc. Natl. Acad. Sci. USA 91:12317-12321; Herrmann et al., 1995, EMBO J. 

25 14:2076-2088; Raker et al, 1996, EMBO J. 15:2256-2269). The results, depicted in 

Figure 3 IB, demonstrate that the Sm proteins B and D3 bind to GST-Gemin3 whereas 
there was no detectable binding to GST alone. D2 binds Gemin3 only weakly and the 
profiles of Sm protein binding to SMN and Gemin3 are not identical (see Example 1). 
For example, SMN binds to Dl while Gemin3 does not. 



10873 V.2 



-158- 



To address the possibility that some component of the rabbit 
reticulocyte lysate mediates these interactions, wild-type full-length SMN and SmB 
were produced as recombinant 6His-tagged proteins and were incubated with GST or 
GST-Gemin3. After several rounds of washing, bound proteins were solubilized by 
boiling in SDS-sample buffer, resolved by SDS-PAGE, immunoblotted and probed 
with a rabbit polyclonal antibody specific to the 6His-tag. As depicted in Figure 31C, 
SMN and SmB bind specifically to GeminS but not to GST alone. Thus, both SMN 
and SmB interact directly with GeminS. 

In order to fiirther characterize the interaction between Gemin3 and 
SMN, it was determined whether SMN carrying two well-characterized mutations 
found in SMA patients, the Y272C point mutant (SMNY272C) and the exon 7 deletion 
mutant (SMNAEx7), the major product of the SMN2 gene (Reviewed in Burghes, 
1997, Am. J. Hum. Genet. 61:9-15; Talbot et al, 1997, Hum. Mol. Genet. 6:497-500), 
was able to interact with Gemin3. SMN wild type and mutants were produced and 
labeled with [^^SJmethionine using in vitro transcription and translation in rabbit 
reticulocyte lysate. Full-length wild-type SMN bound specifically to immobilized 
GST-Gemin3 (Figure 3 ID). However, SMNY272C and SMNAEx7 are severely 
defective in their ability to bind GST-Gemin3. No detectable binding was observed to 
GST alone. Similar results were observed using purified recombinant 6His-SMN wild 
type and mutant proteins instead of in vitro translated products. 

SMN oHgomerization and Sm binding are not mutually exclusive, and 
in fact, Sm binding is strongly enhanced by SMN oligomerization (e.g.. Figure 31E). 
To determine whether SMN self-association enhances Gemin3 interaction, GST-SMN, 
or GST as a control, was pre-incubated with a molar excess of recombinant 6His-tag 
SMN to form SMN oligomers. After removing the unbound 6His-tag SMN by 
washing, in vitro translated [-^^Slmethionine-labeled Gemin3 and SmB were added and 
assayed for binding (Figure 3 IE), SmB binding was strongly enhanced by SMN 
oligomerization, however, Gemin3 binding was not affected. 

Gemin3 interacts with SMN via its unique C-terminal domain 



10873 V.2 



- 159- 



The unwinding activity of DEAD box RNA helicases may not be 
sequence specific. The target specificity of these proteins is at least in some cases 
provided by their interaction with specific proteins of the RNP substrate. These 
interactions appear to be mediated via the unique auxiliary domain that each RNA 
5 helicase contains (Staley and Guthrie, 1998, Cell 92:315-326; Hamm and Lamond, 
1998, Curr. Biol. 8:532-534). Therefore the role of the unique Q-terminal domain of 
Gemin3 (amino acids 430-825) in the interaction with SMN was determined. To do so, 
three deletion mutants of Gemin3 were constructed and tested their ability to bind with 
GST-SMN. Wild-type and mutant myc-Gemin3 constructs were transcribed and 

10 translated in rabbit reticulocyte lysate in the presence of ["'^S] methionine, and the 
resultant translated products were assayed for binding to GST-SMN as described 
previously elsewhere herein. As Figure 32B indicates, the wild-type myc-Gemin3 
protein and myc-AN368C277Gemin3 mutant proteins interact specifically with GST- 
SMN but not with GST alone. The myc-AC328Gemin3 and mycAN548Gemin3 

15 mutant proteins clearly do not interact with GST-SMN. Thus, the C-terminal domain 
of Gemin3 (amino acid 456 to 547) mediates the interaction of SMN with Gemin3. 

The molecular characterization of the spinal muscular atrophy gene 
product, SMN, demonstrated that it is concentrated in novel nuclear structures called 
gems (Liu and Dreyfuss, 1996, EMBO J. 15:3555-3565; Example 1). Coiled bodies 

20 and gems represent nuclear structures that appear to be involved in RNA metabolism 
and in many of the cell lines studied these two bodies are often found in association 
(Lamond and Carmo-Fonesca., 1993, Trends Cell Biol. 3:198-204; Gall et al, 1995, 
Dev. Genet. 16:25-35; Liu et al., 1997, Cell 90:1013-1021 ; Liu and Dreyfiiss., 1996, 
EMBO J. 15:3555-3565; Example 4; Matera and Frey, 1998, Am. J. Hum. Genet. 

25 63:317-321). SMN is also found in the cytoplasm where, together with its tightly 

associated partner, Gemin2, it functions in the assembly of snRNP particles (Example 
2; Example 4). In the nucleus, SMN is required for pre-mRNAospHcing, and likely 
serves to assemble and maintain the splicing machinery in an active form as discussed 
previously elsewhere herein. To perform these functions, SMN must either have an 

30 intrinsic activity or it must recruit to the complex other proteins that can actively affect 



structural transitions in certain RNP targets. Several factors that have the capacity to 
serve in such functions, including assembly and disassembly of components of the 
splicing machinery, have been described. Many of these factors are DEAD/DEAH box 
RNA helicases that are essential for splicing (reviewed in Staley and Guthrie, 1998, ). 
Prp43, for instance, is required for the disassembly of the snRNP-intron lariat complex 
(Arenas and Abelson, 1997, Proc. Natl. Acad. Sci. USA 94: 11 798-1 1802), Prp22 is 
needed to release the mature mRNA from the spliceosome (Company et al, 1991, 
Nature 349:487-493), and Prp24 acts as a recycling factor for U4 and U6 snRNP 
(Raghunathan et al., 1998, Science 279:857-860), 

Using a biochemical approach to characterize new components of the 
SMN complex, a novel DEAD box RNA helicase termed Gemin3 has been identified. 
Gemin3 forms a stable complex with SMN in vivo and in vitro and it co-localizes with 
SMN in nuclear gems. Several lines of evidence suggest that Gemin3 and SMN 
function as a complex in vivo, SMN and Gemin3 can be co-immunoprecipitated and 
both are present in a large (approximately 800 kDa) complex that also contains 
Gemin2. Anti-SMN, anti-Gemin2 or anti-Gemin3 monoclonal antibodies 
immunoprecipitate the spliceosomal snRNP core Sm proteins, as well as several other 
unidentified proteins. Gemin3 interacts directly with SMN and with several snRNP 
Sm core proteins, including B/B*, D2 and D3. In addition, Gemin3 is uniformly 
distributed in the cytoplasm, where snRNP assembly takes place, and it can be 
specifically co-immonuprecipitated with the cytoplasmic pool of Sm proteins. 
Together, these findings suggest that Gemin3 plays an important role in spliceosomal 
snRNP biogenesis. 

DEAD box proteins have been found to be involved in many aspects of 
RNA metabolism, including pre-mRNA splicing, translation, snRNP-snRNP 
interactions, mRNA degradation, and mRNA transport in eukaryotes and prokaryotes 
(Arenas and Abelson, 1997, Proc. Natl. Acad. Sci. USA 94: 1 1798-1 1802; Company et 
al., 1991, Nature 349:487-493; Ohno and Shimura, 1996, Genes & Dev. 10:997-1007; 
Staley and Guthrie, 1998, Cell 92:315-326; Hamm and Lamond, 1998, Curr. Biol. 
8:532-534; De la Cruz et al., 1999, IfflS 24:192-198). One of the major questions 

10873 V.2 



about the function of each DEAD/DEAH box RNA helicase is the identification of the 
specific RNA target for it. Some of the enzymes of this family can unwind generic 
RNA substrates in vitro. For these enzymes, the specificity towards particular RNAs 
therefore appears to be determined by factors that interact with their unique auxiliary 
5 domains. For example, the DEAH-box RNA hehcase Prpl6 is recruited to the 

spliceosome via its unique N-terminal. The specific substrate for Gemin3 has not yet 
been identified and this remains a central question of interest. Although the RNA 
helicase or RNA-dependent ATPase activity for recombinant GeminS has not been 
detected, it is possible that such activity will only manifest itself when Gemin3 is 

10 associated with other proteins as part of a complex, or that it will be detectable once a 
specific RNA or RNP target is found. The interaction of GeminS with SMN is direct, 
amino acids 456 to 547 of GeminS mediate this interaction and, likely as a 
consequence of this, also mediate the localization of GeminS to the gems. Thus, 
without wishing to be bound by theory, GeminS provides the enzymatic activity of the 

15 SMN complex to affect structural transitions in its RNA targets. 

The SMN protein is capable of forming an oHgomer of more than 400 
kDa in vitro and the data disclosed herein demonstrate that SMN co-migrates with an 
approximately 800 kDa complex that also contains Gemin2 and GeminS. It is likely 
that SMN oligomerization is critical for the nucleation of this large complex. In 

20 addition to GeminS and Gemin2, several Sm proteins interact with SMN, and it may be 
that SMN forms a docking platform to bring together in the appropriate spatial 
arrangement the multiple proteins that are involved in the de novo assembly and 
regeneration of its RNP (e.g., snRNP) substrates. Interestingly, the interaction of SMN 
with GeminS is severely reduced by mutations found in SMA patients, such as the 

25 point mutant SMNY272C or the exon 7 deletion. Thus, the formation of the SMN 

platform seems critical for SMN function because SMA affects both the capacity of 
SMN to oligomerize as well as to interact with several Sm proteins and GeminS. 
Likely as a consequence of these defective interactions, the fimction of SMN in the 
regeneration of the spHcing machinery is abolished. 



10873 V.2 



-162. 



Coiled bodies contain the highest local concentration of p80 coilin and 
are enriched in components of three major RNA processing pathways: pre-mRNA 
splicing, histone mRNA 3' maturation and pre-mRNA processing. Gems contain the 
highest local concentration of SMN, Gemin2 and GeminS and are often found 
associated with coiled bodies (see, e.g,, Liu and Dreyfuss, 1996, EMBO J. 15:3555- 
3565; Example 1). Although the definitive function of these two nuclear bodies has 
not been completely elucidated, the characterization of their protein and RNA contents 
represents an important step toward the understanding of their functions. Further 
studies of Gemin3, a novel DEAD box containing protein and component of gems, will 
shed Ught on the functions of the SMN complex and gems. 

Example 6: Production of cell line comprising reduced level SMN for 

study ofSMA 

To facihtate the study of the function of SMN in motor neurons and to 
develop a system that may be useful for testing potential therapeutic approaches, a cell 
Ime can be generated that expresses drastically reduced levels of SMN. The cell can be 
produced by transfecting a neuronal cell line which normally expresses SMN with an 
SMN-modulating sequence, e.g., antisense nucleic acid complementary to a nucleic 
acid encoding SMN or an SMN-specific ribozyme. A stable cell line can be obtained 
having significantly reduced SMN protein levels as demonstrated by decreased SMN 
mRNA expression. 

Generation of cell lines 

Motor neuron cell lines have been generated (Salazar Grueso et al, 
Neuroreport. 2:505-508) fi-om embryonic murine spinal cord cells enriched for motor 
neurons by differential centrifugation, based on the approach originally described by 
Hammond et al (Science 234:1237-1240). Human SMN cDNA clone BCD541 is 
available in the art. SMN cDNA constructs, in either sense or antisense orientation, 
may be subcloned into a mammalian expression vector, for example, pZeoSV2 
(InVitrogen, Carlsbad, CA), using standard methods (see, e.g., Sambrook et al, supra) 
and Ausubel et al., supra). After transfection of the motor neuron cell line with sense, 
antisense, or vector-only constructs, stable integrants may be isolated and the resultant 

10873 V.2 l^a 



transgenic cell lines can be referred to as "sense", "antisense" or "vector-only" cells, 
respectively. The cell lines are cultured as described in Salazar Grueso et al., supra. 
DNA Isolation and PCR Analysis 

The cells are grown in culture and DNA is isolated therefrom using 
standard methods. PCR assays may be performed using primers and probes specific 
for human SMN cDNA as described in, for example, Sambrook et al, supra, and 
Ausubel et al., supra. 

RNA Isolation and RT-PCR 

Total RNA may be extracted from cells using the Rneasy kit (Qiagen, 
Chatsworth, CA) according to the manufacturer's instructions. Total RNA may be 
reverse-transcribed using, for example, the Superscript II reverse transcriptase 
(GIBCO/BRL) and random hexamers per the manufacturer's instructions. PCR may be 
performed as described elsewhere herein using human SMN specific primers. The 
PCR products may be separated on an agarose gel and may then be visualized by 
staining with ethidium bromide. 

Western blot analvsis 

Total cell lysates in the presence of 0.5% TritonX-100 are prepared as 
previously described by Coovert et al. (1997, Hum. Mol. Gen. 6:1205-1214). Equal 
amounts of proteins are loaded on each lane of a SDS-PAGE gel and the Western blot 
is performed as previously described herein. The following monoclonal antibodies can 
be used: anti-SMN (2B1), anti-SIPl (2E17), anti-hnRNPAl (4B10) (Pifiol-Roma et 
al., 1988, Genes & Dev. 2:215-227), and anti-Sm (Y12) (Leraer et al., 1981, Proc. 
Natl. Acad. Sci. USA 78:2737-2741). 

Immunofluorescence analvsis 

Immunofluorescence staining is be carried out essentially as previously 
described (Choi et al., 1984, J. Cell. Biol. 99:1997-2004). Double-label 
immunofluorescence experiments are performed by separate sequential incubations of 
each primary antibody, at an appropriate dilution, followed by incubation with the 
specific secondary coupled to fluoresceineisothiocyanate (FITC) or Texas Red 
(TXRD). Laser confocal fluorescence microscopy is performed and the images from 

10873 V.2 .164. 



each channel are recorded separately, and then the data files are merged. The 
following antibodies may be used: rabbit polyserum anti-p80 coilin (R288) (Andrade et 
al, 1991, J. Exp. Med. 173:1407-1419), mouse IgGl monoclonal anti-SMN (2B1), and 
mouse IgGl monoclonal anti-SIPl (2E17). 
Cell Growth study 

Cells from each of the three cell lines, sense, antisense and vector-only, 
may be synchronized by serum-deprivation and the cells may be plated m triplicate for 
growth analyses. 

Transfection of a neuronal cell line with SMN cDNA constructs 
Himian SMN cDNA constructs, in either sense or antisense orientation, 
are subcloned into a mammalian expression vector, pZeoS V2, to allow constitutive 
expression of the insert driven by the SV40 early enhancer/promoter. A spinal cord 
neuronal cell line exhibiting the characteristics of primary motor neurons, including 
high levels of choline acetyltransferase (ChAT), extensive branching neurite formation, 
and glial-derived neurotrophic factor (GDNF) receptors described by Salazar Grueso et 
al. (Neuroreport 2:505-508; Trupp et al., Nature 381:785-788), may be used for 
transfections. 

Motor neuron cells are transfected with sense, antisense, or vector-only 
constructs, and the cells are grown in selective medium (e.g., medium containing 
zeomycin in the case of pZeoSV2 constructs). Stable transfectants are isolated, and 
these are referred to as sense, antisense, or vector-only cells, respectively. The 
introduction of a human cDNA encoding SMN in a cell line permits the confirmation 
of the presence or absence of the human SMN sequence, as well as facilitating the 
monitoring of human SMN expression. 

Detection of the presence and expression of human SMN in 

transfectants 

The presence of the human SMN transgene DNA in both the sense and 
antisense cells, but not in vector-only cells, may be confirmed by PGR of DNA 
obtained from stable transfectants using human-specific primers. Further, expression 
of the human SMN sense or antisense cDNA constructs may be demonstrated by 

10873 V.2 1<< 



RT-PCR of total RNA using human-specific primers. Using these primers, SMN RNA 
may also be detected in positive control human tissue. 

Detection of altered SMN protein level in transfectants 
Immunoblotting using the anti-SMN monoclonal antibody, 2B1, should 
5 identify whether the cells exhibit reduced levels of expression of SMN. 

Example 7: Cell line lacking endogenous SMN expression 
To facilitate the study of the function of SMN in cells and to develop a 
system for studying the role(s) of SMN m cell processes as well as for testing potential 
therapeutic approaches for SMA, a cell line was generated that lacked endogenous 

10 expression of SMN but was stably transfected with a plasmid vector comprising a 
nucleic acid encoding SMN under the control of an inducible promoter. More 
specifically, a chicken pre-B lymphoid cell line, i.e., DT40, which exhibited a high 
degree (i.e., approximately 1,000-fold higher than normal) of homologous 
recombination, was used to generate a cell line lacking a nucleic acid encoding chicken 

15 SMN. This cell line, wherein the endogenous cSMN expression was repressed by 
expression of non-functional HA-cSMN expressed under the control of an inducible 
promoter (i.e., the tetracycline repressible promoter) (see, e.g.^ Figures 33A, 33B, 34A, 
and 34B), and allows the rapid growth of the recombinant cells as long as tetracycline 
is present in the culture medium thereby repressing expression of HA-cSMN. 

20 However, removal of tetracycline fi-om the medium caused derepression of the 

promoter, expression of HA-cSMN, and inhibition of endogenous wild type cSMN 
expression in the cell. Thus, the cell line provides a stable genetic system that can be 
easily manipulated to emulate the lack of SMN exhibited by the cells of SMA patients 
thereby providing a useful system for study of the mechanisms associated with SMA 

25 and for identifying compounds useful for the treatment of SMA. 

Decreased or absent levels of SMN in the cell of the invention impaired 
cell growth, and eventual cell death, which was presumably mediated by impairment of 
cellular processes in a manner analogous to the disease process demonstrated by SMA 
patients. Therefore, the recombinant cell line containing decreased levels of SMN 

30 protein is a useful model for SMA which is characterized by reduced, or absent, 



10873 V.2 



-166- 



amounts of SMN. Further, the cell line expressing altered, e.g., decreased or absent 
levels of SMN, provides an important tool for identifying compounds useful for 
treatment of SMA. In addition, the cell line can be transfected with mutant, variant, 
and/or homologous forms of SMN such that the effects of change in SMN can be 
identified and characterized providing a system for the study of naturally occurring 
and/or genetically engineered selected forms of SMN. 
Example 8: SMN knock-out cell 

To facilitate the study of the function of SMN in cells and to develop a 
system for studying the role(s) of SMN in cell processes as well as for testing potential 
therapeutic approaches for SMA, a cell line is generated that lacks an endogenous 
nucleic acid encoding SMN but which comprises an exogenous nucleic acid encoding 
SMN under the control of an inducible promoter. More specifically, a chicken pre-B 
lymphoid cell line, Le., DT40, which exhibits a high degree (i.e., approximately 1,000- 
fold higher than normal) of homologous recombination, is used to generate a cell line 
lacking a nucleic acid encoding chicken SMN. Although other cell lines can be used to 
generate an SMN "knock-out" cell line, DT40 provides the advantage that 5MA^is 
present in the cell as a single allele. Thus, unlike the mammalian genome which 
comprises centromeric and telomeric alleles of SMN, as discussed previously 
elsewhere herein, the DT40 cell line comprises a single genetic locus comprising a 
nucleic acid encoding SMN. Therefore, although other cell lines can be used, DT40 
provides an advantageous system for the creation of SMN knock-out cell lines. 

This cell line, wherein the endogenous sequence encoding SMN is 
"knocked-out", is then stably transfected using a plasmid vector comprising an isolated 
nucleic acid encoding chicken SMN under the control of an inducible promoter. This 
system allows the rapid growth of the recombinant cells as long as the promoter is 
induced and SMN is expressed in the cells. However, repression of the promoter 
inhibits production of SMN and the cells eventually cease growth in culture and or 
demonstrate altered growth characteristics as discussed elsewhere herein. Thus, the 
cell line provides a stable genetic system that can be easily manipulated to emulate the 
lack of SMN exhibited by the cells of SMA patients thereby providing a useful system 

10873 V.2 1<'7 



for study of the mechanisms associated with SMA and for identifying compounds 
useful for the treatment of SMA. 

Decreased or absent levels of SMN in the cell of the invention impairs 
cell growth which is mediated by impairment of cellular processes in a manner 
analogous to the disease process demonstrated by SMA patients. Therefore, the 
recombinant cell line containing decreased levels of SMN protein is a useful model for 
SMA which is characterized by reduced, or absent, amounts of SMN. Further, the cell 
line expressing altered, e.g., decreased or absent levels of SMN, provides an important 
tool for identifying compounds useful for treatment of SMA. 

In addition, the cell line can be transfected with mutant, variant, and/or 
homologous forms of SMN such that the effects of change in SMN can be identified 
and characterized providing a system for the study of naturally occurring and/or 
genetically engineered selected forms of SMN. 

Example [31 9: Spec ific sequences in SMN and SIPl which mediate 
their interactions with each other and with SM proteins and which are associated with 
defective interactions in SMA 

The experiments presented in this example mav be summarized as 

follows. 

Deletion mutants of SMN and SIPl were prepared and used to identify 
the domain s involved in the interactions between SMN and SIPl as well as their 
interactions with their associated proteins such as the Sm proteins. Further, the effects 
of specific deletions a nd point mutations on the composition of the SMN complex and 
on the SMN and SIPl cellular localization, with particular interest in targeting to gems. 
were analyzed. The d ata disclosed herein demonstrate that several SMN interactions 
are affecte d by mutations that occur in some SMA patients and that SMN 
oligomerization greatly enhances its interaction with Sm proteins. These results 
suggest a model of the S MN complex in which a SMN/SIPl tetramer for a higher 
oligomer) is the fiinctio nal core required for efficient binding to Sm proteins, and thus 
snRNP assembly. These findings fiirther strengthen the view that SMA is the result of 
a defect in snRNP metabolism. 



10873 V.2 



-168- 



The Materials and Methods used in the experiments presented in this 
example are now described, 

Plasmid Construction 

DNA fragments c orresponding to the open reading frames of SMN and 
SIPl wild-type (wt) and mutant proteins were generated by polymerase chain reaction 
(PCR) amplification using suitable primers. All of the mvc-tagped constructs were 
generated by cloning the PGR inserts into a modified pcDNA3 vector (InVitrogen. 
Carlsbad, CA) downstream from the mvc epitope recognized by the monoclonal 
antibody 9 E10 (Siomi and Dreyftiss. 1995. J. Cell. BioL 129:551-560). Plasmid 
construct s containing SMN fusions to the carboxyl-terminal to the myc-pyruyate 
kinase (P K) were obtained by cloning the SMN coding PCR fragments into a myc-PK 
yector deriyed from pcDNA3 as previously described (Nakielny and Dreyfiiss, 1996. J. 
Cell. BioL 134:1365-1373). Maltose binding protein (MBP)-SMN fiisions were 
obtained by cloning the SMN coding PCR fragments into a modified pcDNAl vector 
(Invitroge n. Carisbad, CA) downstream from the MBP sequence. These vectors 
facilitated the expression driven by the CMV promoter in vivo and by the T7 promoter 
in vitro. 

Production of proteins in vitro 

The r^^Slmethionine-labeled proteins were produced by an (in vitro) 
coupled transcription-translation reaction (Promega Corp., Madison. WI) in the 
presence o f F^^SImethionine (Amersham, Arlington Heights, ILV His6-SMN fiision 
protein wa s expressed from a pET28 bacterial expression system in the E,coli strain 
BL21fDE3)pLysS a nd the ftision protein was purified by Ni+chelation chromatography 
with the N ovagen fMadison^ WI^ His-bind Buffer Kit following the manufacturer's 
protocol. All the GST fusion proteins were expressed from the GST expression vector 
PGEX-5X (Pharmacia) in the E.coli strain BL21(DE3)pLvsS and were purified using 
glutathion e-Sepharose according to the manufacturer's protocol (Pharmacia BiotechV 
SmB cDNA is described in Raker et al. (1996. EMBO J. 15:2256-2269). SMN and 
SIPl cND As were obtained using a veast di-hvbrid screening assay previously 
described herein. 



10873 V.2 



-169- 



In vitro protein-bindine assay 

Purified GST and GST fusion proteins (2 fig) were incubated with 10^ 
cpm of the in vitro translated protein product and 25 ul of glutathione-Sepharose beads 
in 500 ul of binding buffe r (50 mM Tris-HCl TpH 7.51. 200 mM NaCl. 2mM EDTA. 
0.1%NP40. 

2 ug/ml leupeptin and pepstatin A. an d 0.5% anrotinin). After incubation for 1 hour at 
4°C. the resin was pell eted, washed five times wifli 1 ml of binding buffer, and the 
bound flection was elut ed proteins were analyzed bv SDS-PAGE. and the radiolabel 
signal was enhanced bv treatment with AmnifV solution CAmersham. Arlington 
Heights, m. 

Cell culture and treatments 

HeLa cells were cul tured in Dulbecco's modified Eagle's medium 
(DMEM) CGIBCO BRL. Gaithersburg. MP) supplemented with 10% fetal calf serum 
(PCS) (GIBCO BRL). HeLa cell s, plated on glass coverslips. were transfected bv the 
standard calcium phos phate method. Following overnight incubation with DNA. the 
cells were washed and fresh medium was added to the cells. Transfected cells were 
fixed and processed fo r immunofluorescence staining after an additional incubation 
period of approximately 24 -36 hours. 293T cells were plated on 100 mm p lflstir. di.Qhf.« 
and transfected as descr ibed above for HeLa. At approximately 36-48 hours pn.«;t- 
transfection. the cells w ere collected and processed for immunoprecipitation assay as 
described herein. 

Immunoprecipitation and Western blotting 

Immunoprecinitatio ns were carried out using total cell Ivsate prepared in 
the presence of 0.5% Tri ton-X-100 as described previously fPiflol-Roma et al.. 
Genes & Dev. 2:215-227). For immu nobloting. proteins were resolved on a 12.5% 
SDS-polvacrvlamide ge l and transferred to nitrocellulose membrane (Schleicher ^ 
Schuell. Inc.. Keene. NK> using a Bio Trans Model B Transblot apparatus (Gelman 
Science) according to th e instructions of the manufacturer. The nitrocellulose filters 
were incubated in blotting solution (PBS with 5% nonfat milk) for at least 1 hour at 
room temperature, rinsed with cold PB S, and then incubated with primary antibody for 



at least 1 hour at room temp erature. The filters were washed three times in PBS 
containing 0.1% Tween 20 and bound antibodies were detected using the Deroxidaj;e. 
conjugated goat anti-mouse IgG + IgM rJackson ImmunoResearch Laboratories- Bar 
Harbor, ME). The protein bands were visualized bv a ECL Westem blotting detection 
5 kit (Amersham. Arlington Heights. IL ) after washing three times in PBS containing 
0. 1% Tween 20. The anti bodies used for these experiments were as follows: mouse 
monoclonal anti-SMN r2Bl . described pr eviously herein): mouse monoclonal anti- 
SIPl (2E17. described previously here in) mouse monoclonal anti-Sm fY12. described 
in Lemer et al.. 1981. Proc. Nat l. Acad. Sci. USA 78:2737-2741). and mouse 
10 monoclonal anti-mvc (QElOrATCC. Manassas. VAV 

Immunofluorescence microscopy 

Immunofluorescen ce microscopy was carried out essentially as 
described by Choi and Drevfiis s (1984. J. Cell. Biol. 99:1 997-2004). Briefly, primary 
monoclonal anti-mvc antibod y f9E10) wa.s diluted 1:1000 in PBS containing 3% BSA. 
1^ The secondary antibody was a goat anti- mouse IgGl specific antibody coniugated to 
FITC. The incubations for both the primary and secondary antibodies were performed 
at room temperature for 1 hour. Laser confocal fluorescence microscopy was 
performed wit h a Leica TCS 4D rOermanv) confocal microscope. 

The Results of the expe riments presented in this example are now 

20 described. 

In vitro mapping of SMN interactions 
As previo usly discussed herein, the veast two-hvbrid screen 
demonstrated that SMN intera cts with itself and with STPl: further, in vitro binding 
experiments confirmed thes e to be direct interactions (Liu and Dreyfuss. 1 996. EMRO 

2^ J- 15:3555-3565: Liu et al.. 1 997. Cell 90:1013-102: Lonson et al.. 1998. Nature Genet. 
19:63-66). To define the domains of SM N that mediate these interactions, various 
deletion mutants were const ructed and these mutants and their interactions were 
examined bv binding assays with purified re combinant GST chimeric fusion proteins 
of SMN, SIPl and SmB. The in vitro franslation products of SMN wild-tvpe and 

30 mutant constructs, labele d with ^^rS]methionine. are shown in Figure flOAl 39. GST 



10873 V.2 



- 171 - 



alone, used as a negative c ontrol, exhibited no detectable binding. In contrast. GST- 
SIPl bound efficiently to full len gth SMN. to all the SMN carboxvl-terminal deletion 
mutants tested, and to th e amino-terminal deletion mutant of exon 1 rAN27). but to a 
further amino-terminal del etion of exon 2 ('AN92V These results confirm data 
5 previously disclosed herein of binding competition experiments and further identify the 
highly conserved region within exon 2A as essential for SMN interaction with STP1 
(Figure 1151 48-49V Caiho xvl-terminal deletions, including the exon 7 deletion 
(AC16) and a point mutation ( ¥2720 found in severe SMA type I patients, abolished 
or severely impaired binding t o wild tvoe SMN whereas amino-terminal deletions had 

10 no effect. Although ther e was a small amount of SMN and SIP I in the reticulocyte 
lysate, bridging bv endogen ous SMN was unlikely as SMNAN92. which mav interact 
with SMN. did not bind SIPl and all of the carboxvl-terminus deletions, which did not 
interact with SMN. bound SI PI. Therefore the binding assays disclosed above very 
likely represent direct inte ractions between the in vitro translated proteins and the GST- 

15 chimeric fusion proteins. 

As previously disclos ed herein. SMN is found in a comnlex with SIPl 
and together thev function in the assembly of the Sm proteins onto snRNAs. Thus, the 
Sm proteins can be thought of as substrates for SMN and SIPl . Therefore, the 
interaction of SMN with purified recombina nt GST-SmB was examined. Both deletion 

20 of exon 7 (AC16) and the SMA point muta tion (Y272C> strongly affect the interaction 
of SMN with SmB. The binding was c ompletely abolished bv a further deletion of the 
highly conserved "YG box" regi on (Talbot et al. 1997. Hum. Mol. Genet. 6:497-500: 
AC26 and AC1061 Deletio n of exon 2 near the amino-terminus of the protein also 
abolishes the interaction wit h SmB. A summary of these results is presented in Fipire 

25 nOBI 40 and Figure flSI 48-49. 

The dimerization and Sm binding domains of SMN are not identical 
The data disclosed ahovR He monstrate that the SMN dimerization and substrate binding 
domains include the carbox vl-terminus of the protein and that these domains mav he 
overiapping. The precise relationship of the two domains relative to each other was 



10873 V.2 



- 172- 



examined bv looking f or the minimal SMN region required to bind both SMN itself 
and SmB. To do so. several constr ucts were generated containing different portions of 
SMN fused to the malt ose, binding protein (MBP) and the binding of the in vitm 
translated f usions to GST-SMN and to GST-SmB was assayed (Figure FllBI 421. 
MBP itself did not bind e ither GST-SMN or GST-SmB. and MBP-SMN did not bind to 
GST alone but bound ef ficiently to GST-SMN and to GST-SmB (Figure Fl 1 A] 41). 
MBP fused to SMN lacking both the f irst 92 amino acids at the amino-terminus and th^ 
last 26 at the carboxyl-t erminus of SMN (AN92/C26) did not bind SMN or SmR, 
further demonstrating the requirement of these domains for the interactions. Both 
MBP fusions with the c arboxyl-terminus region of SMN which contain the conseryed 
YG box ('AN209 and AN2 5n bound efTiciently to GST-SmB but not to GST-SMN. 
This suggests that the SMN dimeriz ation and SmB binding domains are not identical. 
Moreover, these data identify the highly conseryed region containing the YG box as. 
necessary and sufficien t to mediate SMN interaction with SmB but not sufficient for 
the SMN olieomeriz ation. This conclusion is in agreement with data previously 
disclosed herein demon strating that a peptide corresponding to this highly conserved 
region can efficiently compete with the interaction between SMN and SmB. 

SMN dimerization enhances the interaction with substrate 
Although die expe riments iust described demonstrate that there is a 
difference in the sequen ce requirements of the interaction of SMN wifli itself and with 
SmB. thev do not reve al whether or not these interactions are mutually exclusive. The 
simplest interpretation of the data disclosed previously here (i.e.. peptide competition 
experiments) is that the re is direct binding of SmB to SMN. To determine whether the 
interactions of SMN wit h SMN and SmB are mutually exclusive, binding experiments 
were performed and the results are depicted in Figure f 121 43. GST-SMN. or GST 
alone as a control, was first pre-incuh ated with or without a molar excess of purified 
recombinant His6-tagge d SMN under binding conditions for 2 hours. The unbound 
His6-tagged-SMN was removed bv e xtensive washing after which in vitro translated 
^^rSlmethionine-labeled S MN. SmB or SIPl was added to the beads and the binding 
assay was performed as described previously herein. SMN binding was only slightly 

10873 V.2 



decreased b v the pre-incubation with His6-SMN. thereby demonstrating that not all the 
binding sites were saturated and that SMN can oligomerize. However, the pre- 
incubation wi th His6-tagged-SMN had a striking effect on SmB binding as almost 
100% of the input SmB was now in the bound fraction. The specificity of this binding 
5 was further demonstrated by the lack of an effect on SIPl binding and by the low 

background obtained using GST alone. These results demonstrate that the binding of 
SMN and SmB are not mutually exclusive but, on the contrary, that SMN 
oligomerization greatly enhances the affinity of SMN for SmB. 
SIPl interaction domain with SMN 

10 The amino-terminal region of SMN is required for interaction with SIPl 

(Figure [10 1 39-40 and data previously disclosed herein). Using deletion mutants of 
SIPl (Figure ri3B1 45V the regions of SIPl that are required for its interaction with 
SMN were m apped. In vitro translated SIPl bound efficiently to GST-SMN but not to 
GST alone, or to GST-SmB or GST-SIPl (Figure TlSAl 44). Both carboxv-teiminal 

15 deletions of SIPl (AC89 and AC162). but not the amino-terminal one (ANTIV 

completely a bohshed the binding of SIPl to SMN. These results demonstrate that SM 
does not dim erize and also that SIPl does not bind directly to SmB: instead. SIPl 
interacts dir ectly through its carboxyl-terminus with the amino-terminal region of 
SMN. A sch ematic summary of these interactions is presented in Figure FIBBI 45 and 

20 Figure 1151 48-49. 

SMN complexes in vivo 

As previ ously disclosed herein, in vivo. SMN is found in a complex that 
contains SIP l and Sm proteins as well as several as vet unknown proteins. The SMN- 
containing complexes were studied in vivo by co-immunoprecipitation experiments 

25 after transfe ctions of 293T cells with wild-type or mutant SMN and SIPl . First. 

whether a S ff 1 mutant that does not interact in vitro with SMN can assemble into the 
SMN comple x was assessed. To this end, mvc-tagged SIPl wild type, but not the 
AC89 deletio n mutant, was co-immunoprecipitated with the anti-SMN monoclonal 
antibody 2B1. as expect ed if other interactions, in addition to the one with SMN. did 

30 not bridge the mutant to the complex (Figure THAI 46A). Both mvc-SMN wild-type 

10873 v,2 ^nA 



and mvc-SMNAN92 were co-immunoprecipitated with the anti-SIPl antibody 2E17 
(Figure ri4B1 46BV This demonstrates that a SMN deletion mutant f AN92) which did 
not interact with SIPl and with Sm proteins, was incorporated into the SMN complex 
that also contained SIPL This may be due to the fact that the mutant can still interact 
5 with endogenous SMN bound to SIPL Thus, SMN in vivo is in an oligomeric. at least 
dimeric> complex that also contains SIPL Next. myc-SMN wild type with a carboxyK 
terminal deletion mutant (AC26) lacking the YG box that bound SIPl but did not bind 
SMN and SmB in vitro (Figure TlOl 39-40), were compared for their ability to 
associate with the SMN-SIPl complex. SMNAC26 was co-immunonrecipitated with 
10 anti'SIPl antibodies as efficiently as the wild-type SMN suggesting that it was 
assembled into the SMN/SIPl complex (Figure ri4C1 47). When the 
immunoprecipitation was performed with the anti-Sm antibody Y12. approximately 
one third of the SMNAC26 mutant was co-immuniprecipitated compared with the wild- 
type SMN (Figure ri4C1 47). The SMN Y272C point mutant and SMNAC16 deletion 
15 mutant displayed a similar but reduced effect consistent with the lower effect these 
mutants demonstrated in the in vitro binding experiments. These results demonstrate 
that the SMN/SIPl complex containing the SMN deletion mutant could still assemble 
with the Sm core proteins albeit less efficiently than wild type SMN. 

Subcellular localization of SMN and SIPl mutants 
The expression and cellular localization of the myc-tagged SMN and 
SIPl mutants was determined by indirect immunofluorescence using anti-myc 
antibodies. As expected. SMN wild-type localized to gems. All the carboxyl-terminal 
deletion mutants of SMN and the point mutant Y272C also localized to the gems 
(Figure ri6A-El 50A-E). Howeyer, the SMN mutants exhibited stronger 
nucleoplasmic staining than wild-type SMN> likely reflecting their less efficient 
assembly int o gems which appeared smaller and less bright. Surprisingly, both amino- 
terminal deletion mutants of SMN (AN27 and AN92) has a strikingly different 
localization. In cells transfected with these mutants. SMN-containing accumulations 
were detected in the cytoplasm, and in the nucleus the gems became much bigger and 



20 



25 



10873 V.2 



-175- 



more numerous (Figure ri6F1 SOF and ri6G1 SOGV All these structures which 
contained SMN mutants co-localized with endogenous SMN and SIPl. The amino- 
terminal deletions of SMN thus acted as dominant negative mutants, and further 
detailed analysis of their effects is presented below. The only SMN mutant that did not 
5 co-localize with gems is AN92/C1Q6 which could not interact with SMN. SIPl and Sm 
(Tigure ri6Hl 50HV All the other SMN mutants still retained at least one interaction 
domain suggesting that the assembly into the SMN/SIPl complex in vivo is necessary 
and sufficient to localize the protein into gems. This suggestion is further supported bv 
obseryations on the subcelluar localization of SIPl mutants shown in Figure [17] 51. 

10 Myc-tagged wild-type SIPl transfected into HeLa cells localizes to gems (Tigure ri7A1 
51 A). The amino terminal deletion mutant of SIPl that still bound SM N in vitro 
(AN71 Figure [131 44-45) localized to gems whereas a carboxyl-terminal deletion that 
did not bind SMN in vitro and did not assemble into SMN/SIPl complex in vivo 
f AC89, Figure 1131 44-45 and [141 46>47, respectively) did not localize to gems. 

15 Interestingly, a further carboxy-terminus deletion of SIPl f AC162) exhibited a nuclear 
localization in structures resembling speckles (Huang and Spector, 1992, Proc. Natl. 
Acad. Sci USA 89:305-308). This SIPl mutant also acted as dominant negative as 
endogenous SMN and SIPl became co-localized to these structures, and gems were 
disassembled. 

20 SMN regions necessary and sufficient for targeting to gems 

As SMN and SIPl, both wild type and mutants, localized to gems if 
they interacted with any other components of the complex, the various regions of SMN 
were tested for their capacity to target a reporter protein to gems. To do so, transient 
transfections into HeLa cells were performed using constructs expressing SMN regions 

25 fused to myc-tagged pyruvate kinase (myc-PK) as the reporter protein. Pyruvate 

kinase is normally cytoplasmic and is often utilized as a reporter to identify nuclear 
localization signals (see Siomi and Drevfuss, 1995, J. Cell BioL 129:551-560). Indirect 
immunofluorescence was used with anti-mvc antibodies. It was observed that mvc-PK 
fused to full length SMN localized to gems in addition to locahzing to the cytoplasm 

30 (Figure ri8A1 52A). A deletion mutant of SMN lacking the first 92 amino acids, 

10873 v,2 -176- 



which affected SMN interactions with SIPl and Sm proteins but not with SMN itself, 
was suflScient to target PK to gems (Fieure FISBI 52B). A deletion mutant lacking the 
last 106 amino acids, which affected SMN interaction with itself and Sm proteins but 
not with SIPl, was also sufficient to target PK to gems (Figure TISCI 52C). However. 
5 a mutant missing the middle part of SMN. which domain is present in and common to 
the two previous fusions, and which lacked the ability to interact directly with SMN. 
SIPl and Sm proteins, was completely cytoplasmic (Figure flSD] 52D\ Although a 
sequence resembling a classical nuclear localization signal was present in SMN exon 2. 
no nucleoplasmic staining was detected suggesting the absence of a functional nuclear 

10 localization signal in SMN. Thus. SMN (and possibly SIPl) nuclear import apparently 

occurs when the protein is a component of a complex including snRNPs. 

The disclosures of each and every patent, patent appHcation, and 
publication cited herein are hereby incorporated herein by reference in their entirety. 

While the invention has been disclosed with reference to specific 

15 embodiments, it is apparent that other embodiments and variations of this invention 
may be devised by others skilled in the art without departing from the true spirit and 
scope of the invention. The appended claims are intended to be construed to include all 
such embodiments and equivalent variations. 



10873 V.2 



-177- 



ABSTRACT 

The invention relates to an isolated nucleic acid encoding a eukaiyotic 
Survival of Motor Neuron-Interacting Protein 1 (SEPl), compositions comprising SIPl 
and SEPl and the spinal muscular atrophy (SMA) disease gene product Survival of 
5 Motor Neuron protein (SMN), and diagnostic and therapeutic assays directed to SMA. 
The invention also relates to another protein that specifically interacts with SMN and is 
a component of gems, designated GeminS, and the nucleic acid encoding the protein. 
Additionally, the invention relates to a novel cell line wherein the endogenous SMN 
genes have been deleted and where an exogenous nucleic acid encoding SMN has been 
10 inserted into the cell such that expression of SMN in the cell is under the control of an 
inducible promoter. This novel cell line provides a stable genetic system for the study 
of SMA and for the development of SMA therapeutics. 



10873 V.2 



-178- 




FIG. 1 



B 



111 


1 


§ i § 


§ 2 


2S7 

















IV < 



FIG. 2 







FIG. 3 




* iii 'j >) nii 





\4 Ij^fi^ ^-^^^b 



• .. ..v»"' • 



FIG. 4 



1 ^ GST GST- S MI^ 

g ^ I I " '■ ^ 

eW3 8 01 02 tX3 E F S 



GST-SUN 



^ S <M 

# # }-.o , S I S 5 

{ se CD 00 C3 



SP1 -li^ 



J — 

t 



Sim 



FIG. 5 



>« O O 3< C O 



^^^^ 



^ Si ^ S g 




3. S ^1 21 

c » N c 51 s £ it a*? 




C (^ j ♦ « 



-ill, Ji*, .JiL J^^ 

H <. H c n *t z n c n c ^ ' 



FIG. 8A 



FIG. 8B 





FIG. 9B 



rm f ' l ':"V". .1 




FIG. lOA 



i^^mm asT-^ gst-spi 
. J — ~ • — — 

^45^ .Ml^ w¥ 






FIG. lOB 



- .. r . ^ . .i-f .'; 
' ^ "j' ft 



mmm 



n r 



22? fiCl 

a f & r 

4X1 <^ Cl» 



SmS SP1 a^: Sm8 
— f 



— r ir 





B D1 D3 E 



B 01 
|j 



1^ 03 



- * - * ♦ 



SmB- 



a 



IX 2x 4x « 5X 1CX ajx sx t«( a®£ 



6ST-SMNT272C 

I 1 

J^27X^ ■ 

- sx iiw aw Ws-SMN 



\ ' ■ 

V 



FIG. 12A 



.- > 

FIG. 12B 




22 »2BMdOi^u:(^m^4st^jm^mstumm 



FIG. 13B '^t 



\ r 



m ^17. 






FIG. 14 




FIG. 15B 




FIG. 16 



I 2 ■' ■ 



•vv.-.v<. 





FIG. 17 



....... ......... J:^: 







3St 




I 











FIG. 18 



0 2D # 3J « 25^ 



miO 2D ^« 2U a) 




FIG. 2Q. 





FIG. 21 



i 

Ik 




I ~f 

&m 




FIQ. 22 




FIG. 23 



45 



30 



t4— 



SMN 

SIP1 (Gemjn2) 



CL 



m 

04 



30— 



W J- Sm B/B' 



>- Sm B/B' 



3— Sm Dt-3 
~ Sm E,F,G 




FIG. 24 



CD 




CM 

X 

CM 



guauaQ 



00 



05 

a 



I.V mum 



c 
.S 



£U\iU&Q 



t 



i 



t 



CO r^oQ 

T— C3> CO 



in 



o 

CO 




o 



C : 





F1G.31C 



D 



In vitro translation GST-Gemin3 GST 

I 1 I 1 



45— 





o 






o 






o 






CM 






CM 






CM 






CM 


X 


W 1 


CM 






CM 


111 




> 


< 


> 


< 


> 


<l 








z 








z. 












:§ 


2- 








CO 


m 


CO 


CO 


CO 


CO 


CO 


CO 


03 




3€- 



F1G.31D 




t 



FIG. 3 IF 



A 



B 



825 




FIG. 32A 



In vitro 
translation 



CO 
CM 
CO 

9 

<1 











CM 




O 




<l 




CD 


CO 


<o 




CO 








<3 


< 



GST-SMN 



OQ 
CM 
CO 

O 
< 



1^ 

CM 

o 
< 

00 
(O 
CO 

< 



CO 

<J 



GST 



CO 

CO 



rs- 

CVJ 

. <I 
co- 
co 

CO 

Z: 
< 



CO 




Fia 32B 





FIG. 33B 



FIG. 34A 




FIG. 34B 




•^ir> A- A A. A..m«JlP« JL .iM i — ■ — - ^ HM* -"^ ■•^'^ 



QSTTTC 

CA<jAACACrrCCACCATrCITC3TCTrGCTGGCCCCC^^ 

CACCGTTGATTCCTCCTCCACCACCrATGGGGCCAGATTCTCCTCS^CKJA^^^^ 

GAAGCGTTGQGGAGCATGTTGATAGCTTGGTATATGAGTGGTTATCACACTG 

GATATTACCTGGGGTTAAAACAAAGTCGAATGGAAGCAGCCCTAGAGAGAG 
AMjCCTATCTAAAATAG 



FIG. 35B 



hm ASG: mk"mh am ms B«a»vQv pap bpt 



50 



«3P vm pes AOD tss eim- 

tOff 



RTS DVL lyffi: KMDT FES LLL,.saP VLB SLR AAG FE^ esp 



KJU:.. KBS iSCC LOL 



■mm 



siK iffiK ffi^F^.^'.-m. scp JOTi r« cm isvs sps aiK-.53i^-^^ 

■' • ■ • 2-SO- 

IRL FII. DEA. DKL CBS CS5F QBQ INW jys SIJ^ A3K *VS «a» OT?^ ^ftir SKT 

330 

Km RDP TPV; MM. -SSOPSI, IGl. KQ¥ YKVVSiS YFl.^ AHE. EKT <»c2 ^ «^ BSa 

IPF LVr Sm. HSE AQH LAD XLS SKC FPA ECI SGH «»Q NQE MI^ LKH 

400 

FHC RVL 1ST DLT SRG IDA EKV NLV VNL 0VP LDW CTY MHR XGH IgH FCT UGL 



450 



TVT YCC KGE BOI RXA QKC lOH LLP LFB FXF SGL HEB CVB WDV EVK AAV 



500 



HTY GIA- SVP ITOF tKK Qlft KlE KTh QIQ KAH CTH MftS s«ir WOT aSI, ^ 



550 



OTK ego. PVK. ©X-lIE SPK SLO . CUE <2SE- ^ KNS VOT PVE: 



600 
* 



l«3V KBA LPV SLP QIP CLS SPK MQ POT LTF ABL VSD YBH YIK EGL BKP VBI 



650 



IHH. YTG PSD SPQ van KVX BOK VPV MS, SSQ SGS SBS 



DSD srs 



7Q.0 
* 



SSQ NKS ^ SSS MO LK0 TFV DDR ISL BQP EMG TDT I«P IKY QES 

Ha36A 



^ TBL sm A3Q Qssi ^ wm SBW rlq -tm 



Qm Om DCH REX RLS 



800 



FSD tt<i OYB B3ro HAY YKJI HSif mt* ^..ms 



^^mi AYH Mm» lYL 



■ % 



FIG. 36B 



so: 



ATO GCG GCG GCA. CTT GCC TCG GGA OCC TOA. GGA GCA STG GCG ACT GCT 

' ATG CCG GCT oaG QqPG <SCC G1?G CAG GTC CC© GCC COl G^?tG CCA AC31 CCC 

<^.- CCT. l^tTC; AQm>.mC'.'a^-CQQ: '^OZ GCT GAS^ <^T: eTC-uSQC 'iMSC: CCG 'CGCS. ^s^CC 

OSC ©GG GASP- GTS CTG. TBS GCQ <»G CCG SKC GMC TO: SAG TCA CSG CT6 

250 

COT TCG CGG CCG GTO CTG GAG GGG CTG CGG (SCQ GCC CKSC TTC GAG AG6 CCC 

300 
* 

TCG. CXG GTG CAG CTC AAG GCC ATC CCG TTG- <3m- CGC TGC GGG CTC GAT TTA 

350 

ATT GOT CAA GCT AAA TCT GGC ACC GGG AAA ACC TGT GTG- TTC TCC ACC ATA 

40 0 

GCT TTG GAC TCT CTT GTT CTT GAA AAC TTA AGT ACC CAG ATT TTQ ATC TTG 

450 

GCT CCT ACA AGA GAA ATT GCT GTA CAG ATA CAT TCT GTT ATT ACA GCC ATT 

500 

GGA ATA AAA ATG GAA GGC TTA GAG TGT CAT mC' TTT ATT GGA GGG ACC CCA 

550 

TTA TCA GAA GAC AAA ACC AGA CTT AAA A AG. TGT CAT ATT GCT GTT GGA TCT. 

600 

CCT GGC AGA ATT AAG CAA CTC ATA GAA CTT GAC TAC TTG AAG CCA GGC AGT 

§50 

ATA CGC CTC TTT ATT GTT GAT GAA GCA GAT AAG CTT TTA GAA GAA. OSC AGC 

700 

TTC €^ GAG CAA, ATA AAT TGG ATT TAT TCT TCC TTG CCT GCC AGT AAA CAG 

75.0 

FIG. 36C 



A3*G CTG GC& OTA TCA OCT ACT TOT rrv nit^ *Twm mm«, ^ 

xw* lAX CCC GftA TTT TTG GCT AAT <SCT TOG ACA 

A^f^ TftC ^<5a OAT CCC iU-T TCT G1:a CTG aIi? TCC AG^ Q»r CCA AGT 

..... S50 _ 

cri^: «^ ^^"iro J^^S'^^ gca 

CAT hm TTT GAG GAA. Aft© ACT CAG CAT TFA CftG. <3AA-..C^ T.TC AGC' AGA 

950 

ATT CCA TTT AAT CAA GCT TTA GTC TTT TCT AAT TTO CAC AGC AGA GCA CAA < 

1000 

CM" TTG <K=r iL^ CTT TCT TCT AAA G<X m 

. 10-50 

-it 

<^ AAT ^:.AAT GfiG AAT CAG CGT CTT GAT GCT ATG GCT AAA CTG AAG CAC 

llOQ 

TTT CAT TGC AGA GTC CTC ATT TCC ACA GAT TTG ACT TCT CGT GGG ATT GAT 

1150 

GCT (SAG AAG GTG AAT CTG GTT GTA AAT CTG GAT GTA CCA TTG GAT TGG GAG 

1200 

* 

m:k tag atg cat coer att obg aga gct gcsc cot ttt ggt aca ttg. csgg ctg. 

1250 

ACA GTG ACC TAC TGT TGC CGG GGA GAG GAA GAA. AAT ATG ATG ATG AGA ATT 

1100 

GCC cm- AAA TGT AAT ATC AAC CTT CTC CCT TTA CCA GAT CCC ATT CCT TCT. 

1350 

GGT cm- ATG GAA. SAA TGT GTS GAT TGG GAT GTG GAA GTT AAA GCP GCT GTG 

14430 

CAT ACa TAT GST ATA GCA, ACT GTS CCT AAC GAA CCC TTA A»A AAQ CAA ATT 

1450 

FIG. 36D 



OtfS AAA ATA. GAG. AGA ACC CTT CAA A-PP fflw- a.** 

AST CMS AAA GCr CAJP OCT GAC CAC ATC: 

isoflr 

GCT TCC TCT i«3A AAT AAT ^ tCT CTA TCA GTC AAA TCA AAA: AAT 

IS50: 

""tsm. ■% ' 

GAA AAA GCA ACGr.TCa: CCA AAA <3AA CTG- GGC TOT <mc ASG-.CAA. TQC GAA OJ^. 
CAAATSAAGAATICTSTTCAGACTCCCGTTGAAAACTCCACCAACaGrCAG 



1700 

CAC CAG GTC AAA GAA SCT^ TTA CCT GTG TCA CTC CCC C^, ATT CCT TOOT e5« 

175ft 
*■ 

TGT TCC rrr AAA ATC CAT CAG CCA TAC ACG TO3 ACT TOT OCT GAA TTG GTA. 
1800 

GAG GAT TAT GAA CAT TAT ATT AAA GAG GOG TTA GAG AAA CCT GTG GAA ATC 
1S50 

ATC AGG- CAC TAC ACA GGC CCT GGG GAT CAG ACT GTG AAT CCT CAA AAT GGT 

19 m 

TTT GTG AGA AAT AAA GTT ATT GAA CAG AAA. GTC CCT GTG TTG GCA AGT-W 
iSSB 

mC CAA TCT- GGA cac TGT: Om A^T GAC AGT GAT TCT TAC AGC TCA AGJ^. ACC 



20430 



TCT TCC CAG AGC AAA GGA AAT AAfi TCA TAC TTG GAA. AGC TCT TCT GAT AAT 
2050 

CAG cm AAA GAC TCT GAA TCT ACO CCT GTG GAT GAT CGT ATT TCT TTG GAA 
2100 

CAA CCA CCA AAT GGA ACT GAC ACC CCC AAT CCA GAG AAA TAT CAA GAA TCA 
2150. 

COT GGA ATC CAC ATG AAG ACA AGA CUT AAA GAG GGG GCT AGC CAG AGA GCT 

FIG, 36E 



■ ■ ejidi/^^ €Bm mc 'cm ear- ^ cog tct tcc ttc agjl ttg cag act- 

* (^ &^:,^, G2VA <3&1f (3&m T^. TOT CAT ^M^/^ M^- CTG^ AGfF 

-m? TCT am sec tat CAG G^T tat SIG gag TAC TSOS'-Jmt GCT W: TAC AfiC 

2.3:50. 

GCft TOG. CAR GAA TAT TAT OCT GCG GCT TCT CAT TCA Hm- TAT WSG AAT QCT 
2400 

CAG ACA CAT CCA AGT TGG ATG GCA OCT TAT CAC ATG AAT ACC ATT TAT CTA 
2450 

CAA GAA Am M¥© CTT AST AAC CAS TOS TTA TAG GAT ATA CCT (SSUS ACC ATC 

AGG AAC TGT CAA CAA ATG ATA CCT TTQ GAT ATC CAT CCT CCT CGA CTT ATA 

2600 

GTA CAG TGG TGT ATA GTG GCA TOT CTG ATA AAC TTG AAA AGA CTT GGA TCT 

2^50 

it 

TTC CAC TGG GAC ACA TCC ATT TTT CAG ATT GTT TTG ATT TAG GCC AGG TAT 

ATT ATC TTC ATT TTT AAG AC3T TTC TTT AAG AAA CCT CAT CAG AGT GTT 

2TSiO 

AGC ATC ACT TTC TGG rave CAT AGA TOO TGA. CAG TTT CAG GGT GCC ATT GTC 

2800 

CAT AAG ATC TTC CCA AAC GAT ACA GTT 6AA GCG AGG ACA TAT ACC TCC ACT 

2850 

TAC CTA GCT «CG ATA AAA GCA GTA GAC TTG GTT AGT AAA AAA AAA AAA AAA 

AAA. 



FIG. 36F 



61 gcgctgtgac ctagaatggg cgcatgcgcc gagcggaact ggctggtttg aaaaccatgg 
121 cgtgggtacc agcggagtcc gcagtggaag agttgatgcc tcggctattg- ccggtagagc 



181 cttgcgactt gacggaaggt ttcgatccct cggtaccccc 

241 tgaggcgggt ccagatcgaa gcagctcaat gtccagatgt 

X . \ 301 caaagaagtt gaaaaggaag caaagtgtga atatttctct 

1 T<2UHX'«\ CJCCC^ ctgaaggtta ttccccaaca cttcaatggc aacagcaaca 

^ 421 ttcgacagaa tgtgaacaaa catagaagtc actggaaatc 

481 tgacaatgcc aaaatctgaa gatgaagaag gctggaagaa 

541 tatgtgctga cggggctgtt ggaccagcca caaatgaaag 

601 aaattggttt tcctcccttg cttagtattg ttagcagaat 

661 gtgtcttgga atatctgagt aattggtttg gagaaag.aga 

721 gatggcttta tgctttattg gcttgtcttg aaaagccttt 

781 tgattcggca gcttgcaaga aggtgctctg aagtgaggct 

841 atgagagggt tcctgctttg aatttattaa tctgcttggt 

901 gtgatttagc tgatgagcca tcttgatgta gctgatctct 

961 catgaaggca gcctaactct gaggaaaaca atgccaattc 

1021 cttcaacact atgtgaaggg ttcacatctt aacctgtgca 

1081 tatgggttga tttgaatatc tgaaatatca atggaaaatc 

1141 agtttgaaca gttttctgta atcaagcagc ttgcatagaa 

1201 ataggttctt ggtgctgttt tgttcttttt ttgttttttg 

1261 atacatataa aattttattg aaaat 



gaggacgcct 
tgtggtagct 
ttcaggatgc 
agtggcacag 
acaacagttg 
attttgtctg 
tcctggaata 
gaatcaggca 
ctttactcca 
gttacctgag 
cttagtggat 
tagcaggtat 
cagggataga 
aagtacagat 
attcagattg 
ccactcagtt 
attgtatgat 
ttgttttgtt 



caggaatacc 
caaattgacc 
caacccgccc 
ttttcaactg 
gatagtaatg 
ggtgaaaagt 
gattatgtac 
acagtaacta 
gaattgggaa 
gctcattcac 
agcaaagatg 
tttgaccaac 
agatatttct 
ttcaacacat 
atactcagaa 
tttgatgaac 
gaaattttac 
atttacttat 



1 gaattcggca cgagcggggc ccgaagagct 
61 tgatcttccc gaggactatg atccctccgt 
121 gagagtccag attgaagcag cacgttgtcc 
181 aaagttgcga aagaaacaga ccgttagcat 
241 tgggtactct ccaagcctcc gctggcagca 
301 ccagagtctg cacaagcaca ggggtcactg 
361 aatgccaagc acagaggatg aagagagctg 
421 ttctgaccta gcagctgccc taaacagcga 
481 ggttggtttc ccaccgttgc tgagcattgt 
541 tgtgctagaa tacttggtga actggtttga 
601 ttggctttat gctttgctgg cctgcctgga 
661 tattaggcag ttggcacgaa gatgctcaca 
721 tgatcgggtg tctccactga acttattcat 
781 agatttggct gactgtggtg acccatcttg 
841 cccactctcc cagagcatct cggcaatatc 
901 gtgcaccaac tatatcgttt ttggattcag 
961 aaaaagggcc ttgaaggagc taggacaggc 
1021 ctacaactac caagaagccg aanaagttgc 
1081 gccaatccct gaattttggc accgaccaat 
1141 ntatantata atgttttccc tttcttggga 
1201 tccgttctnt nnnnngttnn ggtntnnnnn 
1261 aanttgggaa aaaatattaa nttgcctcnn 
1321 ttaaaacccc cccctgctgt ttacccctcc 
1381 aaaattcttt aaatccnttc tnttnccntn 
1441 aantattttt tttgcccttt aantttgnan 
1501 cncctttttn tnaaacnncc ccngttnntn 
1561 nncntntttt tnaaaanncc ttncccccnt 
1621 naaaccnntt t 



gatgcccagg ctgttaccgg ttgaggcctg 
gccgcctcgg acccctcagg agtatctgcg 
tgatgtagtc attgcacaga ttgatcccaa 
atctctgtcg ggatgccagc ctgctcctga 
gcaacaagta gcacagtttt ctgctgtccg 
gaggtctcag cctttggaca gcaatgttac 
gaaaaagttc tgtctggggg aacggctata 
gagccagcat ccaggaattg attacattaa 
tagtcggatg agccaggcga cagtaacaag 
agagaggaac tttactccag agctgggtcg 
gaaaccactg ctgcctgagg ctcactctct 
aatcagagct ggggtggaac ataaggaaga 
ctgtctggtt ggcaggtact ttgaacagcg 
atgatgatca ggcagcttta ccccccctcc 
catgctatcc actccccttc tcatccagtg 
gaaactgtgt ggtttaaccc tctcagtgcc 
atggataatc tctanccttc agatgtttaa 
agttgaaccg catctacagc acttcacttt 
tgcacgtcna cctcttgcct gccattggca 
atctgaanga acaacngtct tatttattgt 
ttttcnantc ntttttccat atattggccn 
tgggtttgtt ggaaaaccat tttccnttcc 
ttttgcccnt ttgttcnaca anctgggaaa 
gggaaaaccn nttaaactnt thcnnttnaa 
tgttcccccc cccccnnttt gcctnttttn 
tnggtcnncc cccnnnnccc nngggaancc 
cccntnnttn ngtccnnaat ttnnnnnnnn 



'Fg. 38 



